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HYBRID ADENOVIRUS-AAV VIRUS AND METHODS OF USE THEREOF 

This invention was supported by the National 
Institute of Health Grant No. P30 DK 47757. The United 
5 States government has rights in this invention. 

yj^id of the invention 

The present invention relates to the field of 
vectors useful in somatic gene therapy and the production 
10 thereof . 

ftfWK*r» Qlind Qf * he Invention 

Recombinant adenoviruses are capable of 
providing extremely high levels of transgene delivery to 

15 virtually all cell types, regardless of the mitotic 
state. High titers (10 13 plaque forming units/ml) of 
recombinant virus can be easily generated in 293 cells 
(the adenovirus equivalent to retrovirus packaging cell 
lines) and cryo-stored for extended periods without 

20 appreciable losses. 

The primary limitation of this virus as a 
vector resides in the complexity of the adenovirus 
genome. A human adenovirus is comprised of a linear, 
approximately 36 kb double-stranded DNA genome, which is 

25 divided into 100 map units (m.u.), each of which is 360 
bp in length. The DNA contains short inverted terminal 
repeats (ITR) at each end of the genome that are required 
for viral DNA replication. The gene products are 
organized into early (El through E4) and late (LI through 

30 L5) regions, based on expression before or after the 

initiation of viral DNA synthesis [see, e.g., Horwitz, 
virology . 2d edit., ed. B. N. Fields, Raven Press, Ltd. , 
New York (1990)]. 

A human adenovirus undergoes a highly regulated 

35 program during its normal viral life cycle [Y. Yang et 
al, Proc. Natl - Sci.. USA. 21:4407-4411 (1994)]. 
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Virions are internalized by receptor-mediated endocytosis 
and transported to the nucleus where the immediate early 
genes, Ela and Elb, are expressed. Because these early 
gene products regulate expression of a variety of host 
5 genes (which prime the cell for >rirus production) and are 
central to the cascade activation of early delayed genes 
(e.g. E2, E3, and E4) followed by late genes (e.g. Ll-5) , 
first generation recombinant adenoviruses for gene 
therapy focused on the removal of the El domain. This 
10 strategy was successful in rendering the vectors 

replication defective, however, in vivo studies revealed 
transgene expression was transient and invariably 
associated with the development of severe inflammation at 
the site of vector targeting [S. Ishibashi et al, i. 
15 run, invest .. 22:1885-1893 (1994); J. M. Wilson et al, 
P ror. tel.. USA. £5:4421-4424 (1988); J. M. 

Wilson et al, rMn. Bio.. 2:21-26 (1991); M. Grossman et 
al, ?™ r~M. and Mol. Gen., 12:601-607 (1991)]. 

Adeno-associated viruses (AAV) have also been 
20 employed as vectors. AAV is a small, single-stranded 
(ss) DNA virus with a simple genomic organization (4.7 
kb) that makes it an ideal substrate for genetic 
engineering. Two open reading frames encode a series of 
rep and cap polypeptides. Rep polypeptides (rep78, 
25 rep68, rep62 and rep40) are involved in replication, 
rescue and integration of the AAV genome. The cap 
proteins (VP1, VP2 and VP3) form the virion capsid. 
Flanking the rep and cap open reading frames at the 5' 
and 3' ends are 145 bp inverted terminal repeats (ITRs) , 
30 the first 125 bp of which are capable of forming Y- or T- 
shaped duplex structures. Of importance for the 
d velopment of AAV vectors, the entire r p and cap 
domains can be excised and replac d with a therapeutic or 
reporter transg ne [B. J. Cart r, in "Handbook of 
35 Parvovirus s", ed. , P. Tijss r, CRC Press, pp.155-168 
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(1990)]. It has been shown that the ITRs represent the 
minimal sequence required for replication, rescue, 
packaging, and integration of the AAV genome. 

The AAV life cycle is biphasic, composed of 
5 both latent and lytic episodes. During a latent 

infection, AAV virions enter a cell as an encapsidated 
ssDNA, and shortly thereafter are delivered to the 
nucleus where the AAV DNA stably integrates into a host 
chromosome without the apparent need for host cell 

10 division. In the absence of helper virus, the integrated 
ss DNA AAV genome remains latent but capable of being 
activated and rescued. The lytic phase of the life cycle 
begins when a cell harboring an AAV provirus is 
challenged with a secondary infection by a herpesvirus or 

15 adenovirus which encodes helper functions that are 
recruited by AAV to aid in its excision from host 
chromatin [B. J. Carter, cited above]. The infecting 
parental ssDNA is expanded to duplex replicating form 
(RF) DNAs in a rep dependent manner. The rescued AAV 

20 genomes are packaged into preformed protein caps ids 

(icosahedral symmetry approximately 20 nm in diameter) 
and released as infectious virions that have packaged 
either + or - ss DNA genomes following cell lysis. 

Progress towards establishing AAV as a 

25 transducing vector for gene therapy has been slow for a 
variety of reasons. While the ability of AAV to 
integrate in quiescent cells is important in terms of 
long term expression of a potential transducing gene, the 
tendency of the integrated provirus to preferentially 

30 target only specific sites in chromosome 19 reduces its 
usefulness. Additionally, difficulties surround large- 
scale production of replication defectiv recombinants. 
In contrast to the production of recombinant retrovirus 
or adenovirus, the only widely recognized means for 

35 manufacturing transducing AAV virions entails co- 
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transfection with two different, yet complementing 
plasmids. One of these contains the therapeutic or 
reporter minigene sandwiched between the two cis acting 
AAV ITRs. The AAV components that are needed for rescue 
5 and subsequent packaging of progeny recombinant genomes 
are provided in trans by a second plasmid encoding the 
viral open reading frames for rep and cap proteins. The 
cells targeted for transfection must also be infected 
with adenovirus thus providing the necessary helper 

10 functions. Because the yield of recombinant AAV is 
dependent on the number of cells that are transfected 
with the cis and trans-acting plasmids, it is desirable 
to use a transfection protocol with high efficiency. For 
large-scale production of high titer virus, however, 

15 previously employed high efficiency calcium phosphate and 
liposome systems are cumbersome and subject to 
inconsistencies . 

There remains a need in the art for the 
development of vectors which overcome the disadvantages 

20 of the known vector systems. 

c nW?irY ftf thft Tnvention 

In one aspect, the present invention provides a 
unique recombinant hybrid adenovirus/ AAV virus, which 

25 comprises an adenovirus capsid containing selected 

portions of an adenovirus sequence, 5« and 3' AAV ITR 
sequences which flank a selected transgene under the 
control of a selected promoter and other conventional 
vector regulatory components. This hybrid virus is 

30 characterized by high titer transgene delivery to a host 
cell and the ability to stably integrate the transgene 
into the host cell chromosome in the presence of the rep 
gene. In one embodiment, th transgene is a reporter 
gene. Another embodim nt of the hybrid virus contains a 

35 therapeutic transgene. In a pr f erred embodiment, the 



WOW/13598 PCT/US95/14018 



hybrid virus has associated therewith a polycation 
sequence and the AAV rep gene. This construct is termed 
the hybrid virus conjugate or trans-infection particle. 
In another aspect, the present invention 
5 provides a hybrid vector construct for use in producing 
the hybrid virus or viral particle described above. This 
hybrid vector comprises selected portions of an 
adenovirus sequence, 5» and 3 1 AAV ITR sequences which 
flank a selected transgene under the control of a 

10 selected promoter and other conventional vector 
regulatory components. 

In another aspect, the invention provides a 
composition comprising a hybrid viral particle for use in 
delivering a selected gene to a host cell. Such a 

15 composition may be employed to deliver a therapeutic gene 
to a targeted host cell to treat or correct a genetically 
associated disorder or disease. 

In yet another aspect, the present invention 
provides a method for producing the hybrid virus by 

20 transfecting a suitable packaging cell line with the 
hybrid vector construct of this invention. In another 
embodiment the method involves co-transfecting a cell 
line (either a packaging cell line or a non-packaging 
cell line) with a hybrid vector construct and a suitable 

25 helper virus. 

In a further aspect, the present invention 
provides a method for producing large quantities of 
recombinant AAV particles with high efficiency by 
employing the above methods, employing the hybrid vector 

30 construct of this invention and collecting the rAAV 

particles from a packaging cell line transfected with the 
vector . 

Other aspects and advantages of the pres nt 
invention are d scribed further in the following detailed 
35 d scription of the preferred mbodiments thereof. 
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i> r jof p f script ion of the Drawings 

Fig. 1A is a schematic diagram of a vector 
construct pAd.AV.CMVLacZ [SEQ ID NO: l], which contains 
(from the top in clocJcwise order) adenovirus sequence map 
units 0-1 (clear bar); the 5* AA/ ITR (solid bar); a CMV 
immediate early enhancer /promoter (hatched arrow) , an 
SV40 intron (clear bar), an E. coli beta-galactosidase 
cDNA (LacZ) (hatched line) , an SV40 polyadenylation 
signal (clear bar), a 3- AAV ITR (solid bar), adenovirus 
sequence from map units 9-16 (clear bar) , and a portion 
of a pBR322 derivative plasmid (thin solid line) . 
Restriction endonuclease enzymes are identified by their 
conventional designations; and the location of each 
restriction enzyme is identification by the nucleotide 
15 number in parentheses to the right of the enzyme 

designation. 

Fig. IB is a schematic drawing demonstrating 
linearization of pAd.AV.CMVLacZ [SEQ ID NO: 1] by 
digestion with restriction enzyme Nhel and a linear 
20 arrangement of a Clal digested adenovirus type 5 with 
deletions from mu 0-1. The area where homologous 
recombination will occur (between m.u. 9-16) in both the 
plasmid and adenovirus sequences is indicated by crossed 
lines . 

25 Fig. 1C is a schematic drawing which 

demonstrates the hybrid virus Ad.AV.CMVLacZ after co- 
transfection of the linearized pAd.AV.CMVXacZ [SEQ ID NO: 
1] and adenovirus into 293 cells followed by 
intracellular homologous recombination. 

Fig. 2A-2K report the top DNA strand of the 
double-strand plasmid pAd.AV.CMVLacZ [SEQ ID NO: 1] (the 
compl mentary strand can be readily d rived by one of 
skill in the art) . With reference to SEQ ID NO: 1, 
nucleotides 1-365 ar ad novirus type 5 s quences; the 5' 
35 AAV ITR sequenc spans nucleotides 366-538; the CMV 
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promoter/ enhancer spans nucleotides 563-1157; the SV-40 
intron spans nucleotides 1158-1179; the LacZ gene spans 
nucleotides 1356-4827; the SV-40 poly A sequence spans 
nucleotides 4839-5037; the 3" AAV ITR spans nucleotides 
5 5053 to 5221; nucleotides 5221 vo about 8100 are 

adenovirus type 5 sequences. The remaining sequences are 
non-specif ic/plasmid sequences. 

Fig. 3 is a bar graph plotting u.v. absorbance 
at 420 nm of the beta-galactosidase blue color for a 
10 control and ten putative positive clones (D1A through 

D1J) of 293 cells transfected with the recombinant hybrid 
Ad.AV.CMVLacZ. Eight of the clones expressed high levels 
of enzyme. 

Fig. 4 is a schematic diagram of pRep78/52 [SEQ 
15 ID NO: 2]. This plasmid includes an AAV P5 promoter , 
Rep78, Rep52 and a poly-A sequence in a pUC18 plasmid 
background. 

Figs. 5A - 5E report nucleotides 1-4910 of the 
top DNA strand of the double-strand plasmid pRep78/52 
20 [SEQ ID NO: 2] (the complementary strand can be readily 
derived by one of skill in the art) . 

Fig. 6 is a flow diagram of the construction of 
a trans-infection particle formed by a hybrid virus, a 
poly-L- lysine sequence and attached AAV rep-containing 
25 plasmid. 

Fig. 7 is a flow diagram of the hybrid virus 1 
life cycle, in which a trans-infection particle enters 
the cell and is transported to the nucleus. The virus is 
uncoated and the rep mediates rescue of the inserted 
30 gene, which is then integrated into the chromosome of the 
host cell. 
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I. construction of the Hybrid Vector and Virus 

A. The Adenov irus Component of the Vector and 

Virus 

The hybrid virus of this invention uses 
5 adenovirus nucleic acid sequences as a shuttle to deliver 
a recombinant AAV/transgene genome to a target cell. The 
DMA sequences of a number of adenovirus types, including 
type Ad5, are available from Genbank. The adenovirus 
sequences may be obtained from any known adenovirus type, 

10 including the presently identified 41 human types 

[Horwitz et al, cited above]. Similarly adenoviruses 
known to infect other animals may also be employed in the 
vector constructs of this invention. The selection of 
the adenovirus type is not anticipated to limit the 

15 following invention. A variety of adenovirus strains are 
available from the American Type Culture Collection, 
Rockville, Maryland, or available by request from a 
variety of commercial and institutional sources. In the 
following exemplary embodiment an adenovirus, type 5 

20 (Ad5) is used for convenience. 

The adenovirus nucleic acid sequences 
employed in the hybrid vector of this invention can range 
from a minimum sequence amount, which requires the use of 
a helper virus to produce the hybrid virus particle, to 

25 only selected deletions of adenovirus genes, which 

deleted gene products can be supplied in the hybrid viral 
production process by a selected packaging cell. 
Specifically, at a minimum, the adenovirus nucleic acid 
sequences employed in the pAdA shuttle vector of this 

30 invention are adenovirus genomic sequences from which all 
viral genes are deleted and which contain only those 
adenovirus s quences requir d for packaging adenoviral 
genomic DNA into a pref rmed capsid head. More 
specifically, the adenovirus s quenc s employed are the 

35 cis-acting 5' and 3' inverted t rminal repeat (ITR) 
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sequences of an adenovirus (which function as origins of 
replication) and the native 5« packaging/ enhancer domain, 
that contains sequences necessary for packaging linear Ad 
genomes and enhancer elements for the El promoter. 
5 According to this invention, the entire adenovirus 5' 
sequence containing the 5' ITR and packaging/enhancer 
region can be employed as the 5 ■ adenovirus sequence in 
the hybrid virus. This left terminal (5') sequence of 
the Ad5 genome useful in this invention spans bp 1 to 
10 about 360 of the conventional adenovirus genome, also 
referred to as map units 0-1 of the viral genome, and 
generally is from about 353 to about 360 nucleotides in 
length. This sequence includes the 5' ITR (bp 1-103 of 
the adenovirus genome) ; and the packaging/enhancer domain 
15 (bp 194-358 of the adenovirus genome) . Preferably, this 
native adenovirus 5« region is employed in the hybrid 
virus and vector in unmodified form. Alternatively, 
corresponding sequences from other adenovirus types may 
be substituted. These Ad sequences may be modified to 
20 contain desired deletions, substitutions, or mutations, 
provided that the desired function is not eliminated. 

The 3' adenovirus sequences of the hybrid virus 
include the right terminal (3«) ITR sequence of the 
adenoviral genome spanning about bp 35,353 - end of the 
25 adenovirus genome, or map units -98.4-100. This sequence 
is generally about 580 nucleotide in length. This entire 
sequence is desirably employed as the 3' sequence of a 
hybrid virus. Preferably, the native adenovirus 3« 
region is employed in the hybrid virus in unmodified 
30 form. However, as described above with respect to the 5' 
sequences, some modifications to these sequences which do 
not adversely eff ct their biological function may be 
acceptable. As described below, wh n these 5' and 3- 
adenovirus sequences ar employed in the hybrid vector, a 
35 helper adenovirus which supplies all other essential 
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genes for viral formation alone or with a packaging cell 
line is required in the production of the hybrid virus or 
viral particle. 

Alternative embodiments of the hybrid 
5 virus employ adenovirus sequences in addition to the 

minimum sequences, but which contain deletions of all or 
portions of adenovirus genes. For example, the 
adenovirus immediate early gene Ela (which spans mu 1.3 
to 4.5) and delayed early gene Elb (which spans mu 4.6 to 

10 11.2) should be deleted from the adenovirus sequence 
which forms a part of the hybrid vector construct and 
virus. Alternatively, if these sequences are not 
completely eliminated, at least a sufficient portion of 
the Ela and Elb sequences must be deleted so as to render 

15 the virus replication defective. These deletions, 
whether complete or partial, which eliminate the 
biological function of the gene are termed "functional 
deletions 11 herein. 

Additionally, all or a portion of the 

20 adenovirus delayed early gene E3 (which spans mu 76.6 to 
86.2) may be eliminated from the adenovirus sequence 
which forms a part of the hybrid virus. The function of 
E3 is irrelevant to the function and production of the 
hybrid virus. 

25 All or a portion of the adenovirus delayed 

early gene E2a (which spans mu 67.9 to 61.5) may be 
eliminated from the hybrid virus. It is also anticipated 
that portions of the other delayed early genes E2b (which 
spans mu 29 to 14.2) and E4 (which spans mu 96.8 to 91.3) 

30 may also be eliminated from the hybrid virus and from the 
vector. 

Deletions may also be made in any of the 
late genes LI through L5, which span mu 16.45 to 99 of 
the adenovirus g nome. Similarly, deletions may be 
35 useful in the intermediate genes IX which maps between mu 
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9.8 and 11.2 and IVa 2 which maps between 16.1 to 11.1. 
Other deletions may occur in the other structural or non- 
structural adenovirus. 

The above discussed deletions may occur 

5 individually, i.e., an adenovirub sequence for use in the 
present invention may contain deletions of El only. 
Alternatively, deletions of entire genes or portions 
effective to destroy their biological activity may occur 
in any combination. Por example, in one exemplary hybrid 

10 vector, the adenovirus sequence may contain deletions of 
the El genes and the E3 gene, or of the El, E2a and E3 
genes, or of the El and E4 genes, or of El, E2a and E4 
genes, with or without deletion of E3, and so on. 

The more deletions in the adenovirus 

15 sequence up to the minimum sequences identified above 
that characterize the hybrid virus, the larger the 
sequence (s) of the other below-described components to be 
inserted in the hybrid vector. As described above for 
the minimum adenovirus sequences, those gene sequences 

20 not present in the adenovirus portion of the hybrid virus 
must be supplied by either a packaging cell line and/ or a 
helper adenovirus to generate the hybrid virus. 

In an exemplary hybrid virus of this invention 
which is described below and in Example 1, the adenovirus 

25 genomic sequences present are from mu 0 to 1, mu 9 to 
78.3 and mu 86 to 100 (deleted sequences eliminate the 
Ela and Elb genes and a portion of the E3 gene) . From 
the foregoing information, it is expected that one of 
skill in the art may construct hybrid vectors and viruses 

30 containing more or less of the adenovirus gene sequence. 

The portions of the adenovirus genome in 
the hybrid virus permit high production titers of the 
virus to be produced, often greater than 1x10 13 pfu/ml. 
This is in stark contrast to the low titers (1x10* 

35 pfu/ml) that have been found for recombinant AAV. 



* ! 

WO 96/13598 



PCT/US95/14018 



13 



10 



B. AAV co m ponents of the Vector flfltf Virus 

Also part of the hybrid vectors and 
viruses of this invention are sequences of an adeno- 
associated virus. The AAV sequences useful in the hybrid 
vector are the viral sequences *rom which the rep and cap 
polypeptide encoding sequences are deleted. More 
specifically, the AAV sequences employed are the cis- 
acting 5' and 3' inverted terminal repeat (ITR) sequences 
[See, e.g., B. J. Carter, in "Handbook of Parvoviruses", 
ed., P. Tijsser, CRC Press, pp. 155-168 (1990)]. As 
stated above, the ITR sequences are about 143 bp in 
length. Substantially the entire sequences encoding the 
ITRs are used in the vectors, although some degree of 
minor modification of these sequences is expected to be 
15 permissible for this use. See, e.g., WO 93/24641, 

published December 9, 1993. The ability to modify these 
ITR sequences is within the skill of the art. For 
suitable techniques, see, e.g., texts such as Sambrook et 
al, "Molecular Cloning. A Laboratory Manual.", 2d edit., 
20 Cold Spring Harbor Laboratory, New York (1989). 

The AAV ITR sequences may be obtained from 
any known AAV, including presently identified human AAV 
types. Similarly, AAVs known to infect other animals may 
also be employed in the vector constructs of this 
25 invention. The selection of the AAV is not anticipated 
to limit the following invention. A variety of AAV 
strains, types 1-4, are available from the American Type 
Culture Collection or available by request from a variety 
of commercial and institutional sources. In the 
30 following exemplary embodiment an AAV-2 is used for 
convenience . 

In the hybrid vector construct, the AAV 
sequ nces ar flanked by the sel cted adenovirus 
sequences discussed above. The 5« and 3' AAV ITR 
35 sequ nces themselves flank a selected transgene sequence 
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and associated regulatory elements, described below. 
Thus, the sequence formed by the transgene and flanking 
5« and 3' AAV sequences may be inserted at any deletion 
site in the adenovirus sequences of the vector. For 
5 example, the AAV sequences are desirably inserted at the 
site of the deleted Ela/Elb genes of the adenovirus, 
i.e., after map unit 1. Alternatively, the AAV sequences 
may be inserted at an E3 deletion, E2a deletion, and so 
on. If only the adenovirus 5« ITR/packaging sequences 
10 and 3' ITR sequences are used in the hybrid virus, the 
AAV sequences are inserted between them. 

C. The Trans a ene Component of the flvbrid 

y^*-°r » nd Virus 

The transgene sequence of the vector and 

15 recombinant virus is a nucleic acid sequence or reverse 
transcript thereof, heterologous to the adenovirus 
sequence, which encodes a polypeptide or protein of 
interest. The transgene is operatively linked to 
regulatory components in a manner which permits transgene 

20 transcription. 

The composition of the transgene sequence 
will depend upon the use to which the resulting hybrid 
vector will be put. For example, one type of transgene 
sequence includes a reporter sequence, which upon 

25 expression produces a detectable signal. Such reporter 
sequences include without limitation an E. coli beta- 
galactosidase (LacZ) cDNA, an alkaline phosphatase gene 
and a green fluorescent protein gene. These sequences, 
when associated with regulatory elements which drive 

30 their expression, provide signals detectable by 
conventional means, e.g., ultraviolet wavelength 
absorbance, visible color change, etc. 

Anoth r type of transgene sequence 
includes a therapeutic g ne which expresses a desired 

35 gene product in a host cell. These therapeutic genes or 
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nucleic acid sequences typically encode products for 
administration and expression in a patient in vivo or ex 
vivo to replace or correct an inherited or non-inherited 
genetic defect or treat an epigenetic disorder or 
5 disease. Such therapeutic genet* which are desirable for 
the performance of gene therapy include, without 
limitation, a normal cystic fibrosis transmembrane 
regulator (CPTR) gene, a low density lipoprotein (LDL) 
gene, and a number of genes which may be readily selected 
10 by one of skill in the art. The selection of the 

transgene is not considered to be a limitation of this 
invention, as such selection is within the knowledge of 

those skilled in the art. 

D. fr»mii*torv nemants of fhft Ffvftrlfl Vector 

15 In addition to the major elements 

identified above for the hybrid vector, i.e., the 
adenovirus sequences, AAV sequences and the transgene, 
the vector also includes conventional regulatory elements 
necessary to drive expression of the transgene in a cell 

20 transfected with the hybrid vector. Thus the vector 
contains a selected promoter which is linked to the 
transgene and located, with the transgene, between the 
AAV ITR sequences of the vector. 

Selection of the promoter is a routine 

25 matter and is not a limitation of the hybrid vector 

itself. Useful promoters may be constitutive promoters 
or regulated (inducible) promoters, which will enable 
control of the amount of the transgene to be expressed. 
For example, a desirable promoter is that of the 

30 cytomegalovirus immediate early promoter/ enhancer [see, 
e.g., Boshart et al, fifiU, 11:521-530 (1985)]. Other 
desirable promot rs include, without limitation, the Rous 
sarcoma virus LTR promoter/ enhancer and the chicken p- 
actin promoter. Still other promoter /enhancer sequences 
35 may be selected by one f skill in the art. 
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The vectors will also desirably contain 
nucleic acid sequences heterologous to the adenovirus 
sequences including sequences providing signals required 
for efficient polyadenylation of the transcript and 
5 introns with functional splice donor and acceptor sites. 
A common poly-A sequence which is employed in the 
exemplary vectors of this invention is that derived from 
the papovavirus SV-40. The poly-A sequence generally is 
inserted in the vector following the transgene sequences 

10 and before the 3' AAV ITR sequence. A common intron 

sequence is also derived from SV-40, and is referred to 
as the SV-40 T intron sequence. A hybrid vector of the 
present invention may also contain such an intron, 
desirably located between the promoter /enhancer sequence 

15 and the transgene. Selection of these and other common 
vector elements are conventional and many such sequences 
are available [see, e.g., Sambrook et al, and references 
cited therein] . The DNA sequences encoding such 
regulatory regions are provided in the plasmid sequence 

20 of Fig. 2 [SEQ ID NO: 1]. 

The combination of the transgene, 
promoter /enhancer, the other regulatory vector elements 
and the flanking 5« and 3* AAV ITRs are referred to as a 
"minigene" for ease of reference herein. As above 

25 stated, the minigene is located in the site of any 

selected adenovirus deletion in the hybrid virus. The 
size of this minigene depends upon the amount and number 
of adenovirus sequence deletions referred to above. Such 
a minigene may be about 8 kb in size in the exemplary 

30 virus deleted in the El and E3 genes, as described in the 
examples below. Alternatively, if only the minimum 
adenovirus sequenc s are employed in the virus, this 
minigene may be a size up to about 30 kb. Thus, this 
hybrid vector and vector p rmit a gr at deal of latitude 

35 in the selection of the various components of the 
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minigene, particularly the transgene, with regard to 
size. Provided with the teachings of this invention, the 
design of such a minigene can be made by resort to 
conventional techniques. 
5 E. Hybrid Vec tor Assembly and Production of 

Hybrid Virus 

The material from which the sequences used 
in the hybrid vector, helper viruses, if needed, and 
recombinant hybrid virus (or viral particle) are derived 

10 and the various vector components and sequences employed 
in the construction of the hybrid vectors of this 
invention are obtained from commercial or academic 
sources based on previously published and described 
materials. These materials may also be obtained from an 

15 individual patient or generated and selected using 

standard recombinant molecular cloning techniques known 
and practiced by those skilled in the art. Any 
modification of existing nucleic acid sequences forming 
the vectors and viruses, including sequence deletions, 

20 insertions, and other mutations are also generated using 
standard techniques, 

Assembly of the selected DNA sequences of 
the adenovirus, the AAV and the reporter genes or 
therapeutic genes and other vector elements into the 

25 hybrid vector and the use of the hybrid vector to produce 
a hybrid virus utilize conventional techniques, such as 
described in Example 1. Such techniques include 
conventional cloning techniques of cDNA such as those 
described in texts [Sambrook et al, cited above], use of 

30 overlapping oligonucleotide sequences of the adenovirus 
and AAV genomes, polymerase chain reaction, and any 
suitable method which provides the desired nucleotide 
sequence. Standard transf ction and co-transf ection 
techniques are employ d, e.g., CaP0 4 transf ection 

35 techniques using the complementation human embryonic 
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kidney (HEK) 293 cell line (a human kidney cell line 
containing a functional adenovirus Ela gene which 
provides a transacting Ela protein) . Other conventional 
methods employed in this invention include homologous 
5 recombination of the viral genomes, plaguing of viruses 
in agar overlay, methods of measuring signal generation, 
and the like. 

As described in detail in Example 1 below 
and with resort to Fig. 1, a unique hybrid virus of this 

10 invention is prepared which contains an El-deleted, 

partially E3 deleted, adenovirus sequence associated with 
a single copy of a recombinant AAV having deletions of 
its rep and cap genes and encoding a selected reporter 
transgene. Briefly, this exemplary hybrid virus was 

15 designed such that the AV.CMYLacZ sequence [SEQ ID NO: l] 
(a minigene containing a 5' AAV ITR, a CMV promoter, an 
SV-40 intron, a LacZ transgene, an SV-40 poly-A sequence 
and a 3' AAV ITR) was positioned in place of the 
adenovirus type 5 (Ad5) Ela/Elb genes, making the 

20 adenovirus vector replication defective. 

Because of the limited amount of 
adenovirus sequence present in the hybrid vectors of this 
invention, including the pAV.CHVXacS [SEQ ID NO: 1] 
above, a packaging cell line or a helper adenovirus or 

25 both may be necessary to provide sufficient adenovirus 

gene sequences necessary for a productive viral infection 
to generate the hybrid virus. 

Helper viruses useful in this invention 
contain selected adenovirus gene sequences not present in 

30 the hybrid vector construct or expressed by the cell line 
in which the hybrid vector is transfected. Optionally, 
such a helper virus may contain a second reporter 
minigene which enables s paration of th resulting hybrid 
virus and the helper virus upon purification. The 
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construction of desirable helper viruses is within the 
skill of the art. 

As one example, if the cell line employed 
to produce the recombinant virus is not a packaging cell 
5 line, and the hybrid vector contains only the minimum 
adenovirus sequences identified above, the helper virus 
may be a wild type Ad virus. Thus, the helper virus 
supplies the necessary adenovirus early genes El, E2a, E4 
and all remaining late, intermediate, structural and non- 
10 structural genes of the adenovirus genome. However, if, 
in this situation, the packaging cell line is 293, which 
supplies the El proteins, the helper virus need not 
contain the El gene. 

In another embodiment, when the hybrid 
15 construct is rendered replication defective by a 

functional deletion in El but contains no other deletions 
in Ad genes necessary for production of an infective 
viral particle, and the 293 cell line is employed, no 
helper virus is necessary for production of the hybrid 
20 virus. Additionally, all or a portion of the adenovirus 
delayed early gene E3 (which spans mu 76.6 to 86.2) may 
be eliminated from the helper virus useful in this 
invention because this gene product is not necessary for 
the formation of a functioning hybrid virus particle. 
25 It should be noted that one of skill in 

the art may design other helper viruses or develop other 
packaging cell lines to complement the adenovirus 
deletions in the vector construct and enable production 
of the hybrid virus particle, given this information. 
30 Therefore, this invention is not limited by the use or 
description of any particular helper virus or packaging 
c 11 line. 

Thus, as described in Figs. 1A through 1C, 
the circular plasmid pAd.AV.CMVLacZ [SEQ ID NO: 1] 
35 (containing the minigene and only adenovirus sequences 



If » 

WO 96/13598 



PCT/US95/14018 



20 



10 



from map unit 0 tc 1 and 9 to 16) was digested and co- 
transfected with a selected Ad5 helper virus (containing 
adenovirus sequences 9 to 78.4 and 86 to 100) into 293 
cells. Thus, the packaging cell line provides the El 
proteins and the helper virus provides all necessary 
adenovirus gene sequences subsequent to map unit 16. 
Homologous recombination occurs between the helper virus 
and the hybrid vector, resulting in the hybrid viral 
particle. Growth of this hybrid viral particle in 293 
cells has been closely monitored for greater than 20 
rounds of amplification with no indication of genome 
instability. Rescue and integration of the transgene 
from the hybrid virus into a host cell and further 
modifications of the vector are described below. The 
15 resulting hybrid virus Ad.AV.CMVLacZ combines the high 
titer potential of adenovirus with the integrating 
biology associated with AAV latency. 

6. wyhrirf Vir» « Polvcatlon Con1U<79t98 

Rep expression is required for rescue of 
the rAAV genome to occur. A preferred approach is to 
synthetically incorporate a plasmid permitting expression 
of rep into the hybrid particle. To do so, the hybrid 
viruses described above are further modified by resort to 
adenovirus-poly lysine conjugate technology. See, e.g., 
25 Wu et al, ft. mm. chem. . 16985-16987 (1989); and K. 
J. Fisher and J. M. Wilson, wioehem. J.. 222: 49 (April 
1, 1994), incorporated herein by reference. Using this 
technology, a hybrid virus as described above is modified 
by the addition of a poly-cation sequence distributed 
30 around the capsid of the hybrid viral particle. 

Preferably, the poly-cation is poly-lysine, which 
attaches around th negatively-charged virus to form an 
external positive charg . A plasmid containing the AAV 
rep gene (or a functional portion th reof ) under the 
35 control of a suitable promoter is then complexed directly 
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to the hybrid cap3id, resulting in a single viral 
particle containing the hybrid virus and an AAV rep gene. 
The negatively charged plasmid ONA binds with high 
affinity to the positively charged poly lysine. 
5 Essentially the techniques employed in constructing this 
hybrid virus conjugate or trans-infection particle are as 
described in detail in Example 3 below. 

An alternative embodiment of the hybrid 
vector and resulting viral particle is provided by 

10 altering the rep containing plasmid to also contain an 
AAV cap gene. This embodiment of the hybrid vector when 
in a host cell is thus able to produce a recombinant AAV 
particle, as discussed in more detail below. 

The plasmids employed in these embodiments 

15 contain conventional plasmid sequences, which place a 
selected AAV sequence, i.e., rep and/ or cap gene 
sequences, under the control of a selected promoter. In 
the example provided below, the exemplary plasmid is 
pRep78/52 [SEQ ID NO: 2], a trans-acting plasmid 

20 containing the AAV sequences that encode rep 78 kD and 52 
kD proteins under the control of the AAV P5 promoter. 
The plasmid also contains an SV40 polyadenylation signal. 
The DNA sequence of this plasmid is provided in Fig. 8 
(SEQ ID NO: 2]. 

25 In a similar manner and with resort to 

plasmid and vector sequences known to the art, analogous 
plasmids may be designed using both rep and cap genes, 
and different constitutive or regulated promoters, 
optional poly-A sequences and introns. 

30 The availability of materials to make 

these modified hybrid vectors and viruses and the AAV rep 
and/or cap containing vectors and the techniques involved 
in the assembly of th hybrid vector and rep and/ r cap 
containing plasmids ar convent i nal as described above. 

35 The assembly techniques for the trans-infection particle 
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employ the techniques described above for the hybrid 
vector and the techniques of Wu et al and Fisher et al, 
cited above. The use of this trans-infection particle 
including rescue and integration of the transgene into 
5 the host cell is described below. 

XI. Function of the Hybrid virus 

A. Thft Hybrid virus Infects n Target C?U 

Once the hybrid virus or trans-infection 

10 particle is constructed as discussed above, it is 

targeted to, and taken up by, a selected target cell. 
The selection of the target cell also depends upon the 
use of the hybrid virus, i.e., whether or not the 
transgene is to be replicated in vitro for production of 

15 a recombinant AAV particle, or ex vivo for production 
into a desired cell type for redelivery into a patient, 
or in vivo for delivery to a particular cell type or 
tissue. Target cells may therefor be any mammalian cell 
(preferably a human cell) . For example, in in vivo use, 

20 the hybrid virus can target to any cell type normally 
infected by adenovirus, depending upon the route of 
administration, i.e., it can target, without limitation, 
neurons, hepatocytes, epithelial cells and the like. 
Uptake of the hybrid virus by the cell is caused by the 

25 infective ability contributed to the vector by the 
adenovirus and AAV sequences. 

5. T hG Tansaene is Rescued, 

Once the hybrid Virus or trans-infection 
particle is taken up by a cell, the AAV ITR flanked 

30 transgene must be rescued from the parental adenovirus 
backbone. Rescue of the transgene is dependent upon 
supplying the infected cell with an AAV rep gene. Thus, 
efficacy of the hybrid virus can be measured in terms of 
rep mediated rescue f rAAV from the parental adenovirus 
35 template. 
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The rep genes can be supplied to the 
hybrid virus by several methods. One embodiment for 
providing rep proteins in trans was demonstrated with the 
exemplary hybrid virus Ad.AV.CMVLacZ by transfecting into 
5 the target monolayer of cells previously infected with 

the hybrid vector , a liposome enveloped plasmid pRep78/52 
[SEQ ID NO: 2] containing the genes encoding the AAV rep 
78 kDa and 52 kDa proteins under the control of the AAV 
P5 promoter. Rescue and amplification of a double- 
10 stranded AAV monomer and a double-stranded AAV dimer, 

each containing the LacZ transgene described above, was 
observed in 293 cells. This is described in detail in 
Example 2. 

The production of rep in trans can be 

15 modulated by the choice of promoter in the rep containing 
plasmid. If high levels of rep expression are important 
early for rescue of the recombinant AAV domain, a 
heterologous (non-adenovirus, non-AAV) promoter may be 
employed to drive expression of rep and eliminate the 

20 need for El proteins. Alternatively, the low levels of 
rep expression from P5 that occur in the absence of 
adenovirus El proteins may be sufficient to initiate 
rescue and optimal to drive integration of the 
recombinant AAV genome in a selected use. 

25 More preferably for in vivo use, the AAV 

rep gene may also be delivered as part of the hybrid 
virus. One embodiment of this single particle concept is 
the polycation conjugated hybrid virus (see Fig. 7). 
Infection of this trans-infection particle is 

30 accomplished in the same manner and with regard to the 
same target cells as identified above. The poly lysine 
conjugate of the hybrid virus onto which was directly 
complexed a plasmid that encoded the rep 78 and 52 
proteins, combines all of the functional components into 

35 a single particle structure. Thus, th trans-infection 
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particle permits delivery of a single particle to the 
cell, which is considerably more desirable for 
therapeutic use. Similar experiments to demonstrate 
rescue of the transgene from the hybrid conjugate trans- 
5 infection particle in 293 cells und in HeLa cells are 

detailed in Example 4. 

In another embodiment, the hybrid virus is 

modified by cloning the rep cDNA directly into the 
adenovirus genome portion of the hybrid vector. Because 
10 it is known that even residual levels of rep expression 
can interfere with replication of adenovirus DNA, such 
incorporation of rep into the hybrid vector itself is 
anticipated to requires possible mutation of the rep 
genes to encode only selected domains, and the use of 
15 inducible promoters to regulate rep expression, as well 
as careful placement of the rep genes into the adenovirus 
sequences of the hybrid vector. 

C. TransaenB Integrat e** into Chromosome 

Once uncoupled (rescued) from the genome 
20 of the hybrid virus, the recombinant AAV/transgene 

minigene seeks an integration site in the host chromatin 
and becomes integrated therein, providing stable 
expression of the accompanying transgene in the host 
cell. This aspect of the function of the hybrid virus is 
25 important for its use in gene therapy. The AAV/ 

transgene minigene sequence rescued from the hybrid virus 
achieves provirus status in the target cell, i.e., the 
final event in the hybrid lifecycle (Fig. 7) . 

To determine whether the AAV minigene 
30 rescued from the hybrid virus achieves provirus status in 
a target cell, non-El expressing HeLa cells were infected 
with th hybrid vector-poly-Lysine conjugate complexed 
with pR p78/52 [SEQ ID NO: 2] and passaged until stable 
colonies of LacZ expressing cells are evident. A 
35 duplicate plate of cells was inf cted with the sam 
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conjugate, but instead of being complexed with the 
pRep78/52 plasmid [SEQ ID NO: 2], carried an irrelevant 
plasmid. cells that receive the rep containing hybrid 
particle produced a greater number of stable LacZ 
5 positive colonies than cells injected with the control 
vector. This indicates multiple rescue and integration 
events in cells that expressed rep proteins. 
Confirmation of integration is revealed by characterizing 
the recombinant AAV genome in the hybrid infected cells 
10 and identifying flanking chromosomal sequences (see 
Example 5) . 

III. Use of the Hybrid Viruses and Viral Particles 

in oene Therapy 

15 Tne novel hybrid virus and trans-infection 

particles of this invention provide efficient gene 
transfer vehicles for somatic gene therapy. These hybrid 
viruses are prepared to contain a therapeutic gene in 
place of the .LacZ reporter transgene illustrated in the 

20 exemplary vector. By use of the hybrid viruses and 
trans-infection particles containing therapeutic 
transgenes, these transgenes can be delivered to a 
patient in vivo or ex vivo to provide for integration of 
the desired gene into a target cell. Thus, these hybrid 

25 viruses and trans-infection particles can be employed to 
correct genetic deficiencies or defects. Two examples of 
the generation of gene transfer vehicles for the 
treatment of cystic fibrosis and familial 
hypercholesterolemia are described in Examples 6 and 7 

30 below. One of skill in the art can generate any number 
of other gene transfer vehicles by including a selected 
transgene for the treatment of other disorders. For 
example, the trans- infection particles are anticipated to 
be particularly advantageous in ex vivo gene therapy 
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where transduction and proviral integration in a stem 
cell is desired, such as in bone marrow directed gene 
therapy. 

The hybrid viruses and trans-infection 
5 particles of the present invention may be administered to 
a patient, preferably suspended in a biologically 
compatible solution or pharmaceutically acceptable 
delivery vehicle. A suitable vehicle includes sterile 
saline. Other aqueous and non-aqueous isotonic sterile 
10 injection solutions and aqueous and non-aqueous sterile 
suspensions known to be pharmaceutically acceptable 
carriers and well known to those of skill in the art may 
be employed for this purpose. 

The hybrid viruses and trans-infection 
15 particles of this invention may be administered in 

sufficient amounts to transfect the desired cells and 
provide sufficient levels of integration and expression 
of the selected transgene to provide a therapeutic 
benefit without undue adverse or with medically 
20 acceptable physiological effects which can be determined 
by those skilled in the medical arts. Conventional and 
pharmaceutically acceptable routes of administration 
include direct delivery to the target organ, tissue or 
site, intranasal, intravenous, intramuscular, 
25 subcutaneous, intradermal, oral and other parental routes 
of administration. Routes of administration may be 
combined, if desired. 

Dosages of the hybrid virus and/ or trans- 
infection particle will depend primarily on factors such 
30 as the condition being treated, the selected gene, the 

age, weight and health of the patient, and may thus vary 
among patients. A th rap utically effective human dos 
of the hybrid viruses or trans-inf ction particles of the 
present inv ntion is believ d to be in the range of from 
35 about 20 to about 50 ml of saline solution containing 
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concentrations of from about 1 x 10 7 to 1 x 10 10 pfu/ml 
hybrid virus of the present invention. A preferred human 
dose is about 20 ml saline solution at the above 
concentrations. The dosage will be adjusted to balance 
the therapeutic benefit against any side effects. The 
levels of expression of the selected gene can be 
monitored to determine the selection, adjustment or 
frequency of dosage administration. 



IV. High Efficiency Production of rAAV 

The hybrid viruses and trans-infection 
particles of this invention have another desirable 
utility in the production of large quantities of 
recombinant AAV particles. Due to the complicated 
15 current methods for generating AAV, there is only a 

limited amount of AAV available for use in industrial, 
medical and academic biotechnology procedures. The 
vectors and viruses of the present invention provide a 
convenient and efficient method for generating large 
quantities of rAAV particles. 

According to this aspect of the invention, a 
trans-infection particle is constructed as described 
above and in Example 3 and is employed to produce high 
levels of rAAV as detailed in Example 8, with the 
possible modifications described in Example 9 below. 
Briefly, a plasmid is generated that contains both AAV 
rep and cap genes under the control of a suitable plasmid 
and is complexed to the poly-iysine exterior of the 
hybrid virus as described above. This trans- infection 
particle is then permitted to infect a selected host 
cell, such as 293 cells. The presence of both rep and 
cap permit th formati n of AAV particles in the cells 
and generat an AAV virus titer of about 10* virions. In 
contrast, current meth ds involving the transfection of 
multiple plasmids produce only about 10 7 virion tit r. 
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The rAAV is isolated from the culture by selecting the 
LacZ-containing blue plagues and purifying them on a 
cesium chloride gradient. 

The benefit of this procedure relates to the 
5 fact that the cis AAV element is encoded by the parental 
adenovirus genome. As a result, the trans plasmid is the 
only DNA component that is needed for complex formation. 
The cell is thereby loaded with significantly more copies 
of the trans-acting rep and cap sequences, resulting in 

10 improved efficiency of rescue and packaging. 

Numerous comparative studies focusing on the 
optimal ratio and copy number of the cis and trans 
plasmids for AAV production indicated that there is a 
positive correlation between the trans plasmid copy 

15 number and yield of recombinant virus. As described in 
detail in Example 8, the yield of recombinant AV.CMVLacZ 
virus was increased by 5-10 fold by using the trans- 
infection particle instead of a standard adenovirus 
vector. 

20 The primary limitation associated with the 

production of recombinant AAV using a hybrid virus of 
this invention relates to difficulties that arise in 
distinguishing between the two viruses (i.e., adenovirus 
and AAV) that are produced by the cell. Using the 

25 exemplary vectors and vector components of this 

invention, LacZ histochemical staining could not be used 
to titer the yield of recombinant AV.CMViacZ since any 
contaminating Ad.AV.CMVLacZ hybrid would contribute to 
the final count. Therefore, a rapid Southern blot 

30 technique for quantitating yields of recombinant AAV was 
incorporated. The assay that was developed enabled not 
only quantitation and verificati n of AAV production, but 
also demonstrated the r moval f contaminating hybrid 
virus from purified AAV stocks. 
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Another method for detecting contaminating 
hybrid virions involves modifying the hybrid vector by 
inserting a small second reporter minigene (i.e., 
reporter gene, promoter and other expression control 
5 sequences, where desired) into vhe E3 region of the 

parental adenovirus backbone. Because this reporter is 
not linked to the AAV domain, contaminating hybrid virus 
that is present during purification can be monitored by 
this hybrid-specific marker. Another possible reporter 

10 gene is the nucleic acid sequence for green fluorescent 
protein. With this hybrid vector containing two reporter 
sequences, histochemical staining for alkaline 
phosphatase (adenovirus reporter) or p-galactosidase (AAV 
reporter) activity can be used to monitor each viral 

15 domain. 

The following examples illustrate the 
construction and testing of the hybrid vectors of the 
present invention and the use thereof in the productions 
of recombinant AAV. These examples are illustrative 
20 only, and do not limit the scope of the present 
invention. 

Rvample l - Con struction of a Hybrid Virus 

A first hybrid adenovirus-AAV virus was 

25 engineered by homologous recombination between DNA 

extracted from an adenovirus and a complementing vector 
according to protocols previously described [see, e.g., 
K. F. Kozarsky et al, J. Biol, chem. . 2SSL'- 13695-13702 
(1994) and references cited therein]. The following 

30 description refers to the diagram of Fig. 1. 

Adenovirus DNA was extracted from CsCl purified 
dl7001 virions, an Ad5 (serotype subgroup C) variant that 
carries a 3 kb deletion between mu 78.4 through 86 in the 
nonessential E3 r gion (provided by Dr. William Wold, 

35 Washington University, St. Louis, Missouri) . Adenoviral 
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DNA was prepared for co-transfection by digestion with 
Clal (adenovirus genomic bp position 917) which removes 
the left arm of the genome encompassing adenovirus map 
units 0-2.5. See lower diagram of Fig. IB. 
5 The complementing hybrxd vector, pAd.AV.CMVLacZ 

(see Fig. 1A and Fig. 2 [SEQ ID NO: 1]) was constructed 
as follows: 

A parental cloning vector, pAd.Bglll was 
designed. It contains two segments of wild-type Ad5 

10 genome (i.e., map units 0-1 and 9-16.1) separated by a 
unique Bglll cloning site for insertion of heterologous 
sequences. The missing Ad5 sequences between the two 
domains (adenovirus genome bp 361-3327) results in the 
deletion of Ela and the majority of Elb following 

15 recombination with viral DNA. 

A recombinant AAV genome (AV.CMVLacZ) was 
designed and inserted into the Bglll site of pAd.Bglll to 
generate the complementing plasmid. The linear 
arrangement of AV.CMVLacZ [SEQ ID NO: 1] (see top diagram 

20 of Fig. IB) includes: 

(a) the 5' AAV ITR (bp 1-173) obtained by PCR 
using pAV2 [C. A. Laughlin et al, Gene, 22'* 65-73 (1983)] 
as template [nucleotide numbers 365-538 of Fig. 2 [SEQ ID 
NO: l]]; 

25 (b) a CMV immediate early enhancer /promoter 

[Boshart et al, Sell, 11:521-530 (1985); nucleotide 
numbers 563-1157 of Fig. 2 [SEQ ID NO: 1]], 

(c) an SV40 splice donor-splice acceptor 
(nucleotide numbers 1178-1179 of Fig. 2 [SEQ ID NO: 1]), 

30 (d) E. coli beta-galactosidase cDNA 

(nucleotide numbers 1356 - 4827 of Fig. 2 [SEQ ID NO: 

H), 

(e) an SV40 polyadenylation signal (a 237 Bam 
HI-BclI restriction fragment containing the 
35 cleavage/poly-A signals from both the early and late 
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transcription units; nucleotide numbers 4839 - 5037 of 
Fig. 2 [SEQ ID NO: 1]) and 

(f) 3«AAV ITR, obtained from pAV2 as a SnaBI- 
Bglll fragment (nucleotide numbers 5053 - 5221 of Pig. 2 

5 [SEQ ID NO: 1]) . 

The resulting complementing hybrid vector, 
pAd.AV.CMVXacZ (see Fig. 1A and Fig. 2 [SEQ ID NO: 1]), 
contained a single copy of recombinant AV.ONLacZ flanked 
by adenovirus coordinates 0-1 on one side and 9-16.1 on 

10 the other. Plasmid DNA was linearized using a unique 

Nhel site immediately 5' to adenovirus map unit zero (0) 
(resulting in the top diagram of Fig. IB) . 

Both the adenovirus substrate and the 
complementing vector DNAs were transfected to 293 cells 

15 [ATCC CRL1573] using a standard calcium phosphate 

transfection procedure [see, e.g., Sambrook et al, cited 
above] . The end result of homologous recombination 
involving sequences that map to adenovirus map units 9- 
16.1 is hybrid Ad.AV.CMVLacZ (see Fig. 1C) in which the 

20 Ela and Elb coding regions from the dl7001 adenovirus 
substrate are replaced with the AV.CKV.LacZ from the 

hybrid vector. 

Twenty-four hours later, the transfection 
cocktail was removed and the cells overlayed with 0.8% 

25 agarose containing lx BME and 2% fetal bovine serum 
(FBS) . Once viral plaques developed (typically 10-12 
days post-transf ection) , plaques were initially screened 
for E. coli p-galactosidase (LacZ) activity by overlaying 
the infected monolayer with agarose supplemented with a 

30 histochemical stain for LacZ, according to the procedure 
described in J. Price et al, Proc. Natl. Acad. Pel,, USA , 
M: 156-160 (1987). Positive cl nes (identified by the 
deposit of insoluble blue dye) were isolated, subjected 
to three rounds of fre ze (dry ice/ethanol)- thaw (37»C) 

35 and an aliquot f the suspended plaque was used to infect 
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a fresh monolayer of 293 cells seeded on duplicate 60mm 
plates. 

Twenty-four hours later the cells from one set 
of plates were fixed and again stained for LacZ activity. 
5 Cells from the duplicate plate w*.re harvested, suspended 
in 0.5 ml 10 mM Tris-Cl f pH8.0, and lysed by performing a 
series of three freeze (dry ice/ethanol)-thaw (37 °C) 
cycles. Cell debris was removed by centrifugation and an 
aliquot of the supernatant used to measure LacZ enzyme 
10 activity. 

As indicated in Fig. 3, assays for p- 
galactosidase activity which measured the absorbance at 
420 nm of the beta-galactosidase blue color in successful 
recombinants, revealed that eight of the ten isolated, 
15 putative positive clones (D1A through D1J) expressed high 
levels of enzyme. Histochemical staining produced 

similar results. 

Large-scale production and purification of 
recombinant virus was performed as described in Kozarsky 
20 et al, cited above, and references cited therein. 

fy^nlP 2 - Fu nctional Analysis of Hybrid Vector 

The ability to rescue the AV.CMVLacZ sequence 
[SEQ ID NO: 1] from the hybrid virus represented an 

25 important feature of the hybrid vector and virus systems 
of Example 1. To evaluate this feature, it was necessary 
to provide the necessary AAV gene products in trans that 
direct AAV excision and amplification (i.e. rep 
proteins) . Furthermore, this experiment was conducted in 

30 293 cells to transcomplement the El deletion in the 
Ad.AV.CMVLacZ clones, because the adenovirus El gene 
proteins have been shown to be important for initiating 
the lytic phase of the AAV lif cycle. 

293 cells were seeded onto 6-w 11 35 mm plates 

35 at a density of 1 x 10 6 cells/well. Twenty-four hours 
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later, seeding media [DMEM/10% FBS supplemented with 
antibiotics] was replaced with 1.0 ml DMEM/2% FBS and 
infected with Ad.AV.CMV.LacZ hybrid clones at an MOI of 1. 
Two hours later, each well was transfected with 1 ng 
5 plasmid pRep78/52 [SEQ ID NO: 2j, a trans-acting plasmid 
that encodes the sequence encoding the AAV rep 78 kD and 
52 kD proteins. The rep sequences in this construct are 
under the control of the AAV P5 promoter and utilize an 
SV40 polyadenylation signal. 

10 as a positive control for AAV rescue, 293 cells 

seeded in a 6-well plate as above were co-transfected 
with a cis-acting AAV plasmid pAV.CMVLacS and pRep78/52. 
pAV.CMVLacZ contained AV.CMVLacZ, the identical sequence 
encoded by pAd.AV.CMVLacZ [SEQ ID NO: 1] described in 

15 Example 1 cloned into the Bglll site of pSP72 (Promega) . 

To provide the necessary adenovirus helper 
function for AAV rescue, cells were infected with either 
wild-type Ad5 virus or a first generation El-deleted 
virus Ad.CMhpAP at an MOI of 5, approximately 2 hours 

20 prior to adding the transfection cocktail. Ad.CMhpAP is 
identical to Ad.CMVLacZ (Example 1) with the modification 
that the alkaline phosphatase sequence (which can be 
obtained from Genbank) is inserted in place of the LacZ 
gene. 

25 Transfections were performed with Lipofectamine 

(Life Technologies) according to the instructions 
provided by the manufacturer. Thirty hours post- 
transfection, the cells were harvested and episomal DNA 
(Hirt extract) prepared as described by J. M. Wilson et 

30 al, , t mm. Chem. . 2SH (16) : 11483-11489 (1992). Samples 
were resolved on a 1.2% agarose gel and electroblotted 
onto a nylon membrane. Blots w re hybridized (Southern) 
with a 32 P random prim r-label d r striction fragment 
isolated from the E. coli LacZ cDNA. 
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The full spectrum of duplex molecular species 
that appear during a lytic AAV infection (i.e., monomer ic 
forms of the double stranded intermediates, RFm and RFd, 
respectively) were evident in transfected cells infected 
5 with wild type and El deleted Ad^. No replicative 
intermediates were detected when transf ections were 
performed in the absence of helper virus. 

Hirt extracts from the 293 cells infected with 
putative Ad.AV.CMVLacZ hybrid clones D1A and D1C revealed 

10 a single band corresponding to the viral DNA, when probed 
with a LacZ restriction fragment. In the presence of rep 
proteins 78 and 52, however, the same clones yielded a 
banding pattern that included not only the adenovirus 
DNA, but an RF monomer and dimer of AV.CMVLacZ. A 

15 single-stranded form of AV.CMVLacZ [SEQ ID NO: 1] was not 
evident. Two additional clones gave similar banding 
patterns, DIB and D1H. In all, each of the eight 
Ad •AV.CMVLacZ hybrids that were found in Fig. 3 to 
express high levels of Lac Z activity were positive for 

20 rescue of the AAV domain. 

With the exception of an extra band of 
approximately 3.5 kb, the rescue of the AV.CMVLacZ [SEQ 
ID NO: l] from the hybrid viral DNA was nearly identical 
to results obtained from a standard cis and trans 

25 plasmid-based approach. In these later samples, 

adenovirus helper function was provided by pre- infecting 
cells with either wild-type Ad5 or an El-deleted 
recombinant virus Ad.CBhpAP (also termed H5.CBALP). The 
Ad.CBhpAP virus has the same sequence as the Ad • CMhpAP 

30 virus described above, except that the CMV promoter 

sequence is replaced by the chicken cytoplasmic B-actin 
promoter [nucleotides +1 to +275 as described in T. A. 
Kost et al, Nucl. Acids Res. . H(23):8287 (1983)]. The 
level of rescue in c lis infected with WT Ad5 appeared to 

35 be greater relative to those infected with the 
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recombinant Ad.CEhpAP virus, likely due to the additional 
El expression provided by the wild-type genome. The 
relevance of including an El deleted adenovirus here is 
to document that the level of adenovirus El proteins 
5 expressed in 293 cells is sufficient for AAV helper 
function. 

Ky»m P i<> a - syn thesis of Polvlvsine Conjugates 

Another version of the viral particle of this 

10 invention is a polylysine conjugate with a rep plasmid 
complexed directly to the hybrid virus caps id. This 
conjugate permits efficient delivery of the rep 
expression plasmid pRep78/52 [SEQ ID NO: 2 J in tandem 
with the hybrid virus, thereby removing the need for a 

15 separate transfection step. See, Fig. 8 for a 
diagrammatic outline of this construction. 

Purified stocks of a large-scale expansion of 
Ad.AV.CMVLacZ clone D1A were modified by coupling poly-L- 
lysine to the virion capsid essentially as described by 

20 K. J. Fisher and J. M. Wilson, Biochem. J. . 49-58 
(1994), resulting in an Ad.AV.CMVLacZ-(Lys) n conjugate. 
The procedure involves three steps. First, hybrid 
virions are activated through primary amines on capsid 
proteins with the heterobifunctional water-soluble cross- 

25 linking agent, sulpho-SMCC (sulpho-(N-succinimidyl 4-(N- 
maleimidomethyl)-cyclohexane-l-carboxylate] (Pierce) . 
The conjugation reaction, which contained 0.5 mg (375 
nmol) of sulpho-SMCC and 6 x 10 12 A 260 hybrid vector 
particles in 3.0 ml of HBS, was incubated at 30°C for 45 

30 minutes with constant gentle shaking. This step involved 
formation of a peptide bond between the active N- 
hydroxysuccinimide (NHS) ester of sulpho-SMCC and a free 
amine (e.g. lysine) contributed by an adenovirus protein 
sequence (capsid prot in) in th recombinant virus, 

35 yielding a maleimide-activated viral particle. 
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Unincorporated , unreacted cross-linker was 
removed by gel filtration on a 1 cm x 15 cm Bio-Gel P-6DG 
(Bio-Rad Laboratories) column equilibrated with 50 mM 
Tris/HCl buffer, pH 7.0, and 150 mM HaCl. Peak A 260 
5 fractions containing maleimide-accivated hybrid virus 
were combined and placed on ice. 

Second, poly-L- lysine having a molecular mass 
of 58 kDa at 10 mg/ml in 50 mM triethanolamine buffer (pH 
8.0), 150 mM NaCl and 1 mM EDTA was thiolated with 2- 

10 imminothiolane/HCl (Traut f s Reagent; Pierce) to a molar 
ratio of 2 moles-SH/mole polylysine under N 2 ; the cyclic 
thioimidate reacts with the poly (L- lysine) primary amines 
resulting in a thiolated polycation. After a 45 minute 
incubation at room temperature the reaction was applied 

15 to a 1 cm x 15 cm Bio-Gel P6DG column equilibrated with 

50 mM Tris/HCl buffer (pH 7.0), 150 mM NaCl and 2 mM EDTA 
to remove unincorporated Traut^s Reagent. 

Quantification of free thiol groups was 
accomplished with Ellman's reagent [5,5 '-dithio-bis-(2- 

20 nitrobenzoic acid)], revealing approximately 2 mol of - 
SH/mol of poly (L- lysine) . The coupling reaction was 
initiated by adding 1 x 10 12 A 260 particles of maleimide- 
activated hybrid virus/mg of thiolated poly (L- lysine) and 
incubating the mixture on ice at 4°C for 15 hours under 

25 argon. 2-mercaptoethylamine was added at the completion 
of the reaction and incubation carried out at room 
temperature for 20 minutes to block unreacted maleimide 
sites. 

Virus-poly lysine conjugates, Ad.AV.CMVLacZ- 
30 (Lys) n , were purified away from unconjugated poly(L- 
lysine) by ultracentrifugation through a CsCl step 
gradient with an initial composition of equal volumes of 
1,45 g/ml (bottom step) and 1.2 g/ml (top step) CsCl in 
10 mM Tris/HCl buffer (pH 8.0). Centrifugation was at 
35 90,000 g for 2 hours at 5°C. The final product was 
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dialyzed against 20 fflM Hepes buffer (pH 7.8) containing 

150 mM NaCl (HBS) . 

Complexes of Ad.AV.CMVLacS-(Lys) n with 
P Rep78/52 plasmid DNA [SEQ ID NO: 2] were formed by 
5 adding varying quantities of Ad;.AV.CMVLacZ-(Lys) n in 50/il 
HBS to 0.5 M9 of P Rep78/52 plasmid DHA [SEQ ID NO: 2] in 
50f»l HBS. After 30 minutes incubation at room 
temperature, a complex was formed of the hybrid virus 
Ad.AV.CMVLac*-(Lys) n associated in a single particle with 

10 the plasmid DNA containing the rep genes. 

This complex, termed a trans- infection 
particle, was evaluated for DNA binding capacity by gel 
mobility shift assays performed as described in Fisher et 
al, cited above. This analysis revealed that the plasmid 

15 binding capacity of the purified conjugate (expressed as 
the number of A 260 particles Ad.AV.CMVLacZ-(Lys) n that 
can neutralize the charge contributed by 1 jig plasmid 
DNA) was 1 ftg pRep78/52 plasmid DNA/6.0 x 10 10 A 260 
particles Ad.AV.CMVLacZ-(Lys) n . 

20 

TMfllffi rt* * - Tr a »«-Tnf action Protocol to Demonstrate AAV 

Ytfn.inian and Amplification 

Trans-infection complexes were prepared by 
mixing Ad.AV.CMVLacZ-(Lys) n conjugate with pRep78/52 

25 plasmid [SEQ ID NO: 2] and applied to 293 cells as 

follows. Ad.AV.CMVLac*-(Lys) n (6 x 10 10 A 260 particles) in 
100 nl DMEM was added dropwise to a microfuge tube 
containing 1 ng plasmid DNA in 100 /il DMEM. The mixture 
was gently mixed and allowed to incubate at room 

30 temperature for 10-15 minutes. The trans-infection 

cocktail was added to 293 cells seeded in a 35 mm 6-well 
as detailed above. Thirty hours later, cells were 
harvested and Hirt extracts prepared. 
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Samples were resolved on a 1.2% agarose gel and 
electroblotted onto a nylon membrane. Blots were 
hybridized (Southern) with a P-32 random primer-labeled 
restriction fragment isolated from the E. coll LacZ cDNA. 
5 The Hirt extracts from 293 cells revealed a 

banding pattern that suggested the AV.GMVLacZ minigene 
sequence [SEQ ID NO: 1] was efficiently rescued from the 
hybrid conjugate. Both an RF monomer and dimer of the 
recombinant AV.GMVLacZ sequence were evident. As was 

10 observed previously, the rescue event was dependent on 
rep proteins since 293 cells that were trans-infected 
with a hybrid conjugate complexed with an irrelevant 
reporter plasmid expressing alkaline phosphatase (i.e. 
pCMVhpAP) revealed only Ad.AV.CMVIracZ DNA. This negative 

15 control for rescue was secondarily useful for 

demonstrating the high efficiency of gene transfer to 293 
cells that was achieved with the conjugate vehicle. 

A duplicate set of 293 cells that received 
hybrid conjugate which was further complexed with 

20 alkaline phosphatase expression plasmid were fixed 24 

hours after addition of the trans-infection cocktail and 
histochemically stained for LacZ as described in Price et 
al, cited above, or for alkaline phosphatase activity as 
described in J. H. Schreiber et al, BioTechnioues . 

25 11:818-823 (1993). Here LacZ was a marker for the 

Ad.AV.CMVLacZ hybrid, while alkaline phosphatase served 
as a reporter for the carrier plasmid. Greater than 90% 
of the monolayer was transduced with both 0-galactosidase 
and alkaline phosphatase transgenes, showing the high 

30 efficiency of the conjugate delivery vehicle 

(differential staining revealed a blue color for the 
hybrids containing the LacZ marker and a purple color for 
the plasmids bearing th AP marker) • 

Because of the important role El proteins have 

35 for progression of the AAV lifecycle, it was critical to 
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test the efficiency of the hybrid delivery system in a 
setting where El proteins are not expressed. A 
trans-infection experiment using the hybrid conjugate 
complexed with pRep78/S2 [SEQ ID MO: 2] was therefore 
5 conducted in HeLa cells [ATCC CLL2] to remove the 
involvement of El proteins. The findings suggested 
rescue of AV.CMVLacZ occurred evidenced by the 
accumulation of RF monomers and dimers. Rescue from HeLa 
cells (which unlike the 293 cells do not contain any 

10 adenovirus El proteins) revealed lower levels of rescue 
of the transgene. The expression of rep from the AAV P5 
promoter is upregulated by adenovirus El and signals the 
beginning of the AAV lytic cycle. In the absence of El, 
rep expression from the P5 promoter is virtually silent 

15 which is important for maintenance of the proviral latent 
stages of the AAV lifecycle. It is anticipated that a 
promoter not dependent on El expression will upon 
substitution for P5, overcome this problem. 

20 Example 5 - Integration of the Transgene 

A preliminary study has been performed to 
determine whether the AAV sequence rescued from the 
hybrid virus can achieve provirus status in a target cell 
(Fig. 7). Briefly, HeLa cells [ATCC CCL 2] were infected 

25 with the hybrid conjugate complexed with pRep78/52 [SEQ 
ID NO: 2] and passaged until stable colonies of LacZ 
expressing cells were evident. A duplicate plate of 
cells was infected with the sarnie conjugate, but instead 
of being complexed with the pRep78/52 plasmid [SEQ ID NO: 

30 2], carried an irrelevant plasmid. These findings 

indicated that cells that received the Rep containing 
hybrid particle produced a greater number of stable LacZ 
positiv colonies than cells that were infected with the 
control virus. This could be interpreted as a reflection 

35 of multiple rescue and integrati n events in cells that 
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expressed Rep proteins. However, it is possible that an 
episomal form of AAV that can persist for extended 
periods of time was present. 

To establish the occurrence of integration into 
5 the chromosome of the minigene Av.CMVLacZ from the hybrid 
conjugate, the following experiment is performed. The 
Ad.AV.CHVLacZ-(Lys) n conjugate carrying pRep78/52 plasmid 
[SEQ ID NO: 2] is used to infect HeLa cells [ATCC CRL2] 
(primary fibroblasts may also be used) . The infected 

10 cells are passaged for several generations. The cells 
are grown to confluency, split and allowed to grow to 
confluency again, split again and this cycle repeated as 
desired. This permits sufficient time for uptake, 
expression, replication and integration to occur. See 

15 Fig. 7. 

To verify that the recombinant AAV sequence 
that was rescued from the hybrid genome (step III of Fig. 
7) has integrated into a chromosome of the host cell 
(step IV of Fig. 7) , cells are separated by a 

20 Fluorescence Activated Cell Sorter (FACS) . By this 
technique, those cells containing a stable integrated 
copy of the recombinant AV.CMVLacZ minigene are separated 
based on the presence of the 0-galactosidase reporter. 
These cells are tagged with fluorescein-labeled 

25 antibodies that recognize the p-Gal protein, and are then 
separated from non-transduced cells (i.e. those that did 
not receive a copy of the AAV minigene) by FACS. 

DNA is isolated from this purified population 
of cells and used to construct a genomic library which is 

30 screened for individual clones and the sequence verified. 
If integration occurs, it is documented directly by 
sequence analysis. 
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T^mnrl- * - **** Tranter vehicle for fYPtlrr Fibrosis 

An adenovirus-AAV-CFTR virus constructed by 
modifying the hybrid Ad.AV.CMVIacZ virus described in 
Example 1 to contain the cystic fibrosis transmembrane 
regulator (CFTR) gene [J.R. Rioidan et al. Science, , 
245:1066-1073 (1989)] in place of the lacZ gene, using 
known techniques. One suitable method involves producing 
a new vector using the techniques described in Example 1. 
in this new vector the LacZ minigene is replaced with the 
CFTR minigene. For performance of this method vectors 
bearing the CFTR gene have been previously described and 
can be readily constructed. This new or reconstructed 
vector is used to generate a new virus through homologous 
recombination as described above. The resulting hybrid 
15 virus is termed hybrid Ad.AV.CMVCFTR. It has the 

sequence of Fig. 2 [SEQ ID NO: 1], except that the LacZ 
gene is replaced with CFTR. Alternatively, the LacZ gene 
can be removed from the Ad.AV.CHVJacZ vector of Example 1 
and replaced with the CFTR gene using known techniques. 
20 This virus (or an analogous hybrid virus with a 

different promoter, regulatory regions, etc.) is useful 
in gene therapy alone, or preferably, in the form of a 
conjugate prepared as described in Example 4. 

Treatment of cystic fibrosis, utilizing the 
25 viruses provided above, is particularly suited for in 
vivo, lung-directed, gene therapy. Airway epithelial 
cells are the most desirable targets for gene transfer 
because the pulmonary complications of CF are usually its 
most morbid and life-limiting. Thus, the hybrid vector 
of the invention, containing the CFTR gene, is delivered 
directly into the airway, e.g. by formulating the hybrid 
virus above, into a preparation which can be inhaled. 
For xample, the hybrid virus or c n jugate of the 
inv ntion containing th CFTR gen , is suspended in 0.25 
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molar sodium chloride. The virus or conjugate is taken 
up by respiratory airway cells and the gene is expressed. 

Alternatively, the hybrid viruses or conjugates 
of the invention may be delivered by other suitable 
5 means, including site-directed injection of the virus 
bearing the CFTR gene. In the case of CFTR gene 
delivery, preferred solutions for bronchial instillation 
are sterile saline solutions containing in the range of 
from about l x 10 7 to 1 x 10 10 pfu/ml, more particularly, 

10 in the range of from about 1 x 10 8 to 1 x 10 9 pfu/ml of 
the recombinant hybrid virus of the present invention. 

Other suitable methods for the treatment of 
cystic fibrosis by use of gene therapy recombinant 
viruses of this invention may be obtained from the art 

15 discussions of other types of gene therapy vehicles for 
CF. See, for example, U. S. Patent No. 5,240,846, 
incorporated by reference herein. 

fl y a mpii> 7 - Gen e Transfer Vehicle for Familial 

20 Hvpercho] estarolemia 

Familial hypercholesterolemia (FH) is an 
autosomal dominant disorder caused by abnormalities 
(deficiencies) in the function or expression of LDL 
receptors [M.S. Brown and J.L. Goldstein, Science, 

25 222(4746) : 34-37 (1986); J.L. Goldstein and M.S. Brown, 
"Familial hypercholesterolemia" in Metabolic Basis of 
TT,w^ ft d Disease. , ed. C.R. Scriver et al, McGraw Hill, 
New York, ppl215-1250 (1989).] Patients who inherit one 
abnormal allele have moderate elevations in plasma LDL 

30 and suffer premature life-threatening coronary artery 
disease (CAD) . Homozygous patients have severe 
hypercholesterolemia and life-threatening CAD in 
childhood. 
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A hybrid adenovirus-AAV-LDL virus of the 
invention is constructed by replacing the lacZ gene in 
the hybrid Ad.AV.CHVlacZ virus of Example 1 with an LDL 
receptor gene [T. Yamamoto et al, Cell, 22: 27-38 (1984)] 
5 using known techniques and as described analogously for 
CP in the preceding example. Vectors bearing the LDL 
receptor gene can be readily constructed according to 
this invention. The resulting hybrid vector is termed 
pAd . AV. OSVLDL . 

10 This plasmid or its recombinant virus is useful 

in gene therapy of FH alone, or preferably, in the form 
of a viral conjugate prepared as described in Example 4 
to substitute a normal LDL gene for the abnormal allele 
responsible for the gene. 

15 A. T!v Vivo Gene Therapy 

Ex vivo gene therapy can be performed by 
harvesting and establishing a primary culture of 
hepatocytes from a patient. Known techniques may be used 
to isolate and transduce the hepatocytes with the above 

20 vector (s) bearing the LDL receptor gene(s). For example, 
techniques of collagenase perfusion developed for rabbit 
liver can be adapted for human tissue and used in 
transduction. Following transduction, the hepatocytes 
are removed from the tissue culture plates and reinfused 

25 into the patient using known techniques, e.g. via a 
catheter placed into the inferior mesenteric vein. 
B. Tn Vivo Gene Therapy 

Desirably, the in vivo approach to gene 
therapy, e.g. liver-directed, involves the use of the 

30 hybrid viruses and viral conjugates described above. A 
preferred treatment involves infusing a trans- infect ion 
particle of the invention containing LDL into the 
peripheral circulation of the patient. The patient is 
then evaluated for change in serum lipids and liver 

35 tissues. 
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The hybrid virus or viral conjugate can be 
used to infect hepatocytes in vivo by direct injection 
into a peripheral or portal vein (10 7 -10 8 pfu/kg) or 
retrograde into the biliary tract (same dose) . This 
5 effects gene transfer into the majority of hepatocytes. 

Treatments are repeated as necessary, e.g. 
weekly. Administration of a dose of virus equivalent to 
an MOI of approximately 20 (i.e. 20 pfu/hepatocyte) is 
anticipated to lead to high level gene expression in the 
10 majority of hepatocytes. 

fiyamp-le B - Efficient Production of Recombinant AAV using 
A Hybrid Virus /Conjugate 

The following experiment demonstrated that the 

15 AAV genome that was rescued from the Ad.AV.CMVLacZ hybrid 
virus could be packaged into an AAV caps id, provided the 
cap open reading frame was supplied in trans. Thus the 
viruses of this invention are useful in a production 
method for recombinant AAV which overcomes the prior art 

20 complications that surround the high titer production of 
recombinant AAV. 

A. Trans-infection Protocol for the 

Prrtfrvr*-* 0 " of rAAV 

A trans-infection complex was formed 

25 composed of the Ad.AV.CMVLacZ-(Lys) n conjugate described 
above and a transcomplementing plasmid pAdAAV, which is 
described in detail in R. J. Samulski et al, J, Virol., 
£2(9) : 3822-3828 (1989)]. Briefly, plasmid pAdAAV encodes 
the entire rep and cap open reading frames in the absence 

30 of AAV ITRs, and has been shown to provide the necessary 
AAV helper functions for replication and packaging of 
recombinant AAV sequences. 

Ad.AV.CMVLacZ-(Lys) n conjugate (4.5 x 10 13 
A 26 o particles) in 75 ml DMEM was added dropwise with 

35 constant g ntle swirling in 25 ml DMEM containing 750 ng 
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pAdAAV helper pltsmid and incubated at room temperature 
for 10-15 minutes. The complex was diluted with 450 ml 
DMEM supplemented with 2% PBS and 20 ml aliquots were 
added to monolayers of 293 cells seeded on 150 mm plates. 
5 Forty hours post tranj-infection, cells were 

harvested, suspended in 12 ml 10 mM Tris-Cl (pH 8.0), and 

stored at -80 °C. 

Because the anticipated outcome was the 
production of hybrid virus Ad.AV.CMVLacZ and a 

10 recombinant AAV virion (AV.CMVLacZ) , both of which carry 
a functional LacZ minigene, it was not possible to use 
detection of LacZ activity as an indicator of AV.CMV.LacZ 
production. A novel molecular approach was developed 
that could be performed in one day and permitted 

15 identification of the packaged viral DNAs. 

B. Purification of rAAV 

Briefly, frozen cell suspensions were 
subjected to three rounds of freeze-thaw cycles to 
release recombinant AV.CMVLacZ and hybrid Ad.AV.CMVLacZ. 

20 On completion of the final thaw, bovine pancreatic DNAse 
(2000 units) and ribonuclease (0.2 mg/ml final 
concentration) was added and the extract incubated at 
37 °C for 30 minutes. Cell debris was removed by 
centrifugation (5000xg for 10 minutes) and the clarified 

25 supernatant (15 ml) applied to a 22.5 ml step gradient 

composed of equal volumes of CsCl at 1.2 g/ml, 1.36 g/ml, 
and 1.45 g/ml lOmM Tris-Cl, pH8.0. Viral particles were 
banded at 25,000 rpm in a Beckman sw-28 rotor for 8 hours 
at 4«c. One ml fractions were collected from the bottom 

30 of the tube. 

The fractions retrieved from the CsCl 
gradient of partially purified virus are then digested to 
release viral DHA from virion capsids as follows. A 
5.0m1 sample of each fraction was transferred to a 
35 microfuge tube containing 20 pi capsid digestion buffer 
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10 



30 



(50mM Tris-Cl, pHC.O, l.OmM EDTA, pH8,0, 0.5% SDS f and 
1.0 mg/ml Proteinase K) . The reaction was incubated at 
50°C for 1 hour, allowed to cool to room temperature, 
diluted with 10 Ml milli-Q water, and agarose gel loading 
dye added. 

These fractions are then analyzed by 
Southern blotting. Samples were resolved on a 1.2% 
agarose gel, electroblotted onto a nylon membrane. A 32 P 
labeled LacZ restriction fragment which was common to 
both vectors was used as a hybridization probe to locate 
the migration of viral DNA through the agarose gel. 
Viral bands were quantitated on a Molecular Dynamics 
Phosphoimager. 

A sample of the extract before CsCl 
15 banding was also tested and revealed both hybrid 

Ad.AV.CMVLacZ DNA and double-stranded RF forms (monomers 
and dimers) of the rescued AV.CKVLacZ sequence [SEQ ID 
NO: 1]. A single-stranded monomer of AV.CMVLacZ appeared 
to be present in the crude extract; however, it was not 
20 until the virions were concentrated by buoyant density 
ultracentrifugation that the single-stranded genome 
became clearly evident. The single-stranded recombinant 
genome of the virus was distributed over a range of CsCl 
densities and revealed a biphasic banding pattern. The 
25 two peaks of single-stranded rAAV genome occurred at 

densities of 1.41 and 1.45 g/ml CsCl, consistent with the 
reported buoyant densities of wild-type AAV in CsCl [L. 
M. de la Maza et al, J. Virol.. 33:1129-1137 (1980)]. 
Analysis of the fractions corresponding to the two vector 
forms revealed the rAAV-1.41 species was several orders 
of magnitude more active for lacZ transduction than the 
denser rAAV-1.45 g/ml variant. To avoid confusion with 
contaminating Ad. AAV, samples were heat inactivated (60°C 
for 30 min) before being added to indicator HeLa cells. 
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The peak fractions of rAAV-1.41 were 
combined and purified by equilibrium sedimentation in 
CsCl to eliminate residual adenovirus particles and 
concentrate rAAV virions. On the final round of 
5 ultracentrifugation, a faint buc clearly visible 
opalescent band was observed in the middle of the 
gradient tube. Fractions that surrounded the band were 
evaluated for density, absorbance at 260 nm, and lacZ 
transducing particles. As the band eluted from the 

10 gradient tube, a well defined peak of 260 nm absorbing 
material was recorded, with a maximal absorbance 
occurring at a density of 1.40 g/ml CsCl. Analysis for 
lacZ transducing particles on HeLa cells revealed a peak 
of activity that mirrored the absorbance profile. These 

15 results indicate rAAV was produced from the hybrid Ad. AAV 
virus. Furthermore, the titers achieved using the hybrid 
virus were 5-10 fold elevated compared to more 
conventional recombinant AAV production schemes (i.e., 
transfections with cis- and trans-acting plasmids) . This 

20 represents a significant improvement in rAAV production 
and indicates that the hybrid is useful for large-scale 
rAAV production. 

All references recited above are incorporated 
herein by reference. Numerous modifications and 

25 variations of the present invention are included in the 
above-identified specification and are expected to be 
obvious to one of skill in the art. Such modifications 
and alterations to the compositions and processes of the 
present invention, such as those modifications permitting 

30 optimal use of the hybrid viruses as gene therapy 
vehicles or production vehicles for recombinant AAV 
production, are believed to be encompass d in the scope 
of the claims appended hereto. 
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Kelley, William M. 
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(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howson and Howson 

(B) STREET: Spring House Corporate Cntr, PO Box 457 

(C) CITY: Spring House 

(D) STATE: Pennsylvania 

(E) COUNTRY: USA 

(F) ZIP: 19477 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
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(C) OPERATING SYSTEM: PC-DOS/MS-DOS 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



wAAl X WwW 1A 


GCATCATCAA 


TAATATACCT 

X *TL«* X A A AW W X 


X A X X X X WW A X 


TGAAGCCAAT 

X w AA w W WAA X 


50 


ATftATAATGA 


GGGG6TGGAG 


TTTWFGAOCF 

X X 1\JAUAW4 




GTGGGAACGG 


100 




V9 X AW X A W X S3 X 


UU^VIVSAAU X w 


X UAlw X X wWA 


Aw X w X ww Www 


150 

x>*s w 


& & r* & r» A TWP A 


ao w Vj a LvniA X 


ftTfVZP A A A AC 


X VJA^U X X X X X 


wW X w X UwUvV 


200 


bblVjiAUlUi 


n/lA AflTYIAPA 

WWAAW lunWi 


aX X X X WwWwW 


wwX X X X AWwW 


/J/J a. TVJ'T" 1 WP A 

WWA X w X X W X A 


250 

*f V 


ViXAAAI 1 L\a%a 


bLb lAAvwA 


ulAAbAi x lu 




vwwAAAAv X 


inn 


GAATAAGAGG 


AAGTGAAATC 


try** » m mA ikfTwrwn 
TGAATAATTT 


TGTGTTACTC 


ATAGCGCGTA 


i cn 
JDU 


ATATTTGTCT 


s ni/'wii 
AGGGAGAxwl 


GCrrGCGCGCT 


CGCTCGCTCA 


wxGAwGUwww 


Ann 


CCGGGCAAAG 


CCCGGGCGTC 


GGGCGACCTT 


TGGTCGCCCG 


GCCTCAGTGA 


450 


GCGAGCGAGC 


GCGCAGAGAG 


GGAGTGGCGA 


ACTCCATCAC 


TAGGGGTTCC 


500 


TTGTAGTTAA 


TGATTAACCC 


GCCATGCTAC 


TTATCTACAA 


TTCGAGCTTG 


550 


CATGCCTGCA 


GGTCGTTACA 


TAACTTACGG 


TAAATGGCCC 


GCCTGGCTGA 


600 


CCGCCCAACG 


ACCCCCGCCC 


ATTGACGTCA 


ATAATGACGT 


ATGTTCCCAT 


650 


AGTAACGCCA 


ATAGGGACTT 


TCCATTGACG 


TCAATGGGTG 


GAGTATTTAC 


700 


GGTAAACTGC 


CCACTTGGCA 


GTACATCAAG 


TGTATCATAT 


GCCAAGTACG 


750 


CCCCCTATTG 


ACGTCAATGA 


CGGTAAATGG 


CCCGCCTGGC 


ATTATGCCCA 


800 


GTACATGACC 


TTATGGGACT 


TTCCTACTTG 


GCAGTACATC 


TACGTATTAG 


850 


TCATCGCTAT 


TACCATGGTG 


ATGCGGTTTT 


GGCAGTACAT 


CAATGGGCGT 


900 


GGATAGCGGT 


TTGACTCACG 


GGGATTTCCA 


AGTCTCCACC 


CCATTGACGT 


950 


CAATGGGAGT 


TTGTTTTGGC 


ACCAAAATCA 


ACGGGACTTT 


CCAAAATGTC 


1000 



WO 96/13598 



PCT/US95/14018 



50 

GTAACAACTC CGCCCCATTG ACGCAAATGG 
GAGGTCTATA TAAGCAGAGC TCGTTTAGTG 
ACGCCATCCA CGCTGTTTTG ACCTCCATAG 
GCCTCCGGAC TCTAGAGGAT CCGGTACTCG 
GTTAACTGGT AAGTTTAGTC TTTTTGTCTT 
GGTGGTGGTG CAAATCAAAG AACTGCTCCT 
TCTAGGCCTG TACGGAAGTG TTACTTCTGC 
ACCCGCGGCC GCAATTCCCG GGGATCGAAA 
GAAGTCACCA TGTCGTTTAC TTTGACCAAC 
CGGTCTGGGA GGCATTGGTC TGGACACCAG 
ATCCCGTCGT TTTACAACGT CGTGACTGGG 
CTTAATCGCC TTGCAGCACA TCCCCCTTTC 
AGAGGCCCGC ACCGATCGCC CTTCCCAACA 
AATGGCGCTT TGCCTGGTTT CCGGCACCAG 
CTGGAGTGCG ATCTTCCTGA GGCCGATACT 
GCAGATGCAC GGTTACGATG CGCCCATCTA 
TTACGGTCAA TCCGCCGTTT GTTCCCACGG 
TCGCTCACAT TTAATGTTGA TGAAAGCTGG 
AATTATTTTT GATGGCGTTA ACTCGGCGTT 
GCTGGGTCGG TTACGGCCAG GACAGTCGTT 
AGCGCATTTT TACGCGCCGG AGAAAACCGC 
TTGGAGTGAC GGCAGTTATC TGGAAGATCA 
GCATTTTCCG TGACGTCTCG TTGCTGCATA 
GATTTCCATG TTGCCACTCG CTTTAATGAT 
GGAGGCTGAA GTTCAGATGT GCGGCGAGTT 
CAGTTTCTTT ATGGCAGGGT GAAACGCAGG 



GCGGTAGGCG 


Iav 1 AlAH* IavO 


1050 


AACCGTCAGA 




1 1 nn 


AAGACACCGG 




1 150 


AGGAACTGAA 


AAA^UAuAAA 




TTATTTCAGG 


TCCCGGATCC 




GAGTGGATGT 


TGCCTTTACT 


i inn 


TCTAAAAGCT 


GCGGAATTGT 


J. JDU 


GAGCCTGCTA 


AAGCAAAAAA 


i a nn 
14UO 


AAGAACGTGA 


TTTTCGTTGC 




CAAGGAGCTG 


CTCAAGCGCG 


15U0 


AAAACCCTGG 


CGTTACCCAA 


1550 


GCCAGCTGGC 


GTAATAGCGA 


looo 


GTTGCGCAGC 


CTGAATGGCG 


i £Rn 


AAGCGGTGCC 


GGAAAGCTGG 


i ion 
17O0 


GTCGTCGTCC 


CCTCAAACTG 


1 *t Kn 
1 / DO 


CACCAACGTA 


ACCTATCCCA 


1800 


AGAATCCGAC 


GGGTTGTTAC 


1850 


CTACAGGAAG 


GCCAGACGCG 


* Ann 
1900 


TCATCTGTGG 


TGCAACGGGC 


1950 


TGCCGTCTGA 


ATTT6ACCT6 


2000 


CTCGCGGTGA 


TGGTGCTGCG 


2050 


GGATATGTGG 


CGGATGAGCG 


2100 


AACCGACTAC 


ACAAATCAGC 


2150 


GATTTCAGCC 


GCGCTGTACT 


2200 


GCGTGACTAC 


CTACGGGTAA 


2250 


TCGCCAGCGG 


CACCGCGCCT 


2300 
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TTCGGCGGTG AAATTATCGA TGAGCGTGGT 
ACTACGTCTG AACGTCGAAA ACCOGAAACT 
ATCTCTATCG TGCGGTGGTT GAACTGCACA 
GAAGCAGAAG CCTGCGATGT CGGTTTCCGC 
TCTGCTGCTG CTGAACGGCA AGCCGTTGCT 
ACGAGCATCA TCCTCTGCAT GGTCAGGTCA 
CAGGATATCC TGCTGATGAA GCAGAACAAC 
GCATTATCCG AACCATCCGC TGTGGTACAC 
TGTATGTGGT GGATGAAGCC AATATTGAAA 
AATCGTCTGA CCGATGATCC GCGCTGGCTA 
AACGCGAATG GTGCAGCGCG ATCGTAATCA 
CGCTGGGGAA TGAATCAGGC CACGGCGCTA 
TGGATCAAAT CTGTCGATCC TTCCCGCCCG 
AGCCGACACC ACGGCCACCG ATATTATTTG 
ATGAAGACCA GCCCTTCCCG GCTGTGCCGA 
CTTTCGCTAC CTGGAGAGAC GCGCCCGCTG 
CGCGATGGGT AACAGTCTTG GCGGTTTCGC 
GTCAGTATCC CCGTTTACAG GGCGGCTTCG 
TCGCTGATTA AATATGATGA AAACGGCAAC 
TGATTTTGGC GATACGCCGA ACGATCGCCA 
TCTTTGCCGA CCGCACGCCG CATCCAGCGC 
CAGCAGTTTT TCCAGTTCCG TTTATCCGGG 
CGAATACCTG TTCCGTCATA GCGATAACGA 
CGCTGGATGG TAAGCCGCTG GCAAGCGGTG 
CCACAAGGTA AACAGTTGAT TGAACTGCCT 
CGCCGGGCAA CTCTGGCTCA CAGTACGCGT 



Uull A iVI v wU A X VUvU X W»AW 


2350 


w X VltlAwvULU wAAA x uv \* wa 


2400 


v>Uw\^vwA^VJ\J va\*w\**waa> x 


2450 


AHVlfVI/tA TfGAAAATGG 

Vjnb\i luLwUA X X wAAAA X WW 


2500 


WAX X VwAwwW w X X /U»v^VJ X V* 


2550 


ms*s« A rrv* it f2Af , ^ATV3fi , W2 
TGGaTuAu wi uAWVja lw x w 


2600 


TllAACbL X uuuu X w xxv 


2650 


G UlviuwAL VtfW X AvllVv** 


2700 


CCCACGuCAl uuluLUAAiu 


2750 


CCGGwviAluA ^CunAUwuvj x 


2800 


CCCGAG xa» 1 vj a X mi w xtjw x 


2850 


ATGAwGAL* wV ut Xla X A X v-wW 


2900 


GTGCAGTATG AAGGtwtWj 


2950 


CCCGATGTAC GCGCGCbiWs 


3000 


AATGGTCCAT CAAAAAA X Wj 




ATCCTTTGCG AATACGCCCA 


1 1 no 


TAAATACTGG CAGGCGTTTC 




TCTGGGACTG GGTGGATCAG 




CCGTGGTCGG CTTACGGCww 


JaJv 


GTTCTGTATG AACGGTtrXww 


•9 J V w 


TGACGGAAGC AAAACACCAG 


3350 


CAAACCATCG AAGTGACCAG 


3400 


GCTCCTGCAC TGGATGGTGG 


3450 


AAGTGCCTCT GGATGTCGCT 


3500 


GAACTACCGC AGCCGGAGAG 


3550 


AGTGCAACCG AACGCGACCG 


3600 
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CATGGTCAGA AGCCGGGCAC ATCAGCGCCT 
GAAAACCTCA GTGTGACGCT CCCCGCCGCG 
GACCACCAGC GAAATGGATT TTTGCATCGA 
AATTTAACCG CCAGTCAGGC TTTCTTTCAC 
AAACAACTGC TGAC6CCGCT GCGCGATCAG 
TAACGACATT GGCGTAAGTG AAGCGACCCG 
TCGAACGCTG GAAGGCGGCG GGCCATTACC 
CAGTGCACGG CAGATACACT TGCTGATGCG 
CGCGTGGCAG CATCAGGGGA AAACCTTATT 
GGATTGATGG TAGTGGTCAA ATGGCGATTA 
AGCGATACAC CGCATCCGGC GCGGATTGGC 
GGTAGCAGAG CGGGTAAACT GGCTCGGATT 
CCGACCGCCT TACTGCCGCC TGTTTTGACC 
GACATGTATA CCCCGTACGT CTTCCCGAGC 
GACGCGCGAA TTGAATTATG GCCCACACCA 
TCAACATCAG CCGCTACAGT CAACAGCAAC 
CATCTGCTGC ACGCGGAAGA AGGCACATGG 
TATGGGGATT GGTGGCGACG ACTCCTGGAG 
TACAGCTGAG CGCCGGTCGC TACCATTACC 
TAATAATAAC CGGGCAGGCC ATGTCTGCCC 
CATTATGTAC TATTTAAAAA ACACAAACTT 
TTTTCTTTTA CTTTTTTATC ATGGGAGCCT 
TGGCTACATG ACATCAACCA TATCAGCAAA 
TGCCGCTATT TCTCTGTTCT CGCTATTATT 
TTTCTGACAA ACTCGGCCTC GACTCTAGGC 
GATAAGATAC ATTGATGAGT TTGGACAAAC 



GGCAGCAGTG 


GCGTCTGGCG 


3650 


TCCGACGCCA 


TCCCGCATCT 


3700 


GCTGGGTAAT 


AAGCGTTGGC 


3750 


AGATcTGGAT 


TGGCGATAAA 


3800 


TTCACCCGTG 


CACCGCTGGA 


3850 


CATTGACCCT 

W«» X X W*»WW W X 


AACGCCTGGG 

Anv w W W X \*w>* 


3900 


AGGCCGAAGC 


AGCGTTGTTG 

nWw X X ^7 X X W 


3950 


GTGCTGATTA 

wX wwxwax XA 


CGACCGCTCA 

vUAv ww W X Wl 


4000 

'WWW 


TATCAGCCGG 
x ni w\uv w ww 


AAAACOTACC 

AAAAWW X AWW 


4050 

IvJV/ 


CCGTTGATGT 

Ww\J X X wAX w X 


X wAAw X ww ww 


41 00 


CTGAACTGCC 

X* X WAAW X WWW 


AGCTGGCGCA 

Aw W X WW Ww WA 


4150 


AGGGCCGCAA 

A w w w W W w WU1 


GAAAAfTATP 

wAAAAW X A X W 


4900 


wW X wwwA A W X 


wWWAX X wX WA 


4950 


GAAAACGGTC 

W«*fW%WW\7X W 


TGCGCTGCGG 

X w Ww W X vvvw 


4300 


GTGGCGCGGC 


GAr*nW*AGT 

wAWX XWWAwX 


4350 


TGATGGAAAC 

X UA X WW AAA W 


CAGCCATCGC 

WAw WWAX Www 


4400 


tiUAAlAlUi 






LLLulUtviA 


x LwLbbAA 1 


4Z>UU 


AwX AUvi w X\» 


VliUl WAAAAA 




w X AX X Ivu^u 


X AAwwAAAX W 


H wU w 


TTGGATGTTC 


GGTTTATTCT 


4650 


ACTTCCCGTT 


TTTCCCGATT 


4700 


AGTGATACGG 


GTATTATTTT 


4750 


CCAACCGCTG 


TTTGGTCTGC 


4800 


GGCCGCGGGG 


ATCCAGACAT 


4850 


CACAACTAGA 


ATGCAGTGAA 


4900 
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AAAAATGCTT TATTTGT6AA ATTTGTGATG 
ATTATAAGCT GCAATAAACA AGTTAACAAC 
GTTTCAGGTT CAGGGGGAGG TGTGGGAGGT 
CGAGTAGATA AGTAGCATGG CGGGTTAATC 
AGTGATGGAG TTGGCCACTC CCTCTCTGCG 
CCGGGCGACC AAAGGTCGCC CGACGCCCGG 
GTGAGCGAGC GAGCGCGCAG CAGATCTGGA 
ACCCGCACCA GGTGCAGACC CTGCGAGTGT 
CCAGCCTGTG ATGCTGGATG TGACCGAGGA 
TGCTGGCCTG CACCCGCGCT GAGTTTGGCT 
TGAGGTACTG AAATGTGTGG GCGTGGCTTA 
AGGTGGGGGT CTTATGTAGT TTTGTATCTG 
CCATGAGCAC CAACTCGTTT GATGGAAGCA 
ACGCGCATGC CCCCATGGGC CGGGGTGCGT 
CATTGATGGT CGCCCCGTCC TGCCCGCAAA 
AGACCGTGTC TGGAACGCCG TTGGAGACTG 
GCCGCTGCAG CCACCGCCCG CGGGATTGTG 
CCCGCTTGCA AGCAGTGCAG CTTCCOGTTC 
TGACGGCTCT TTTGGCACAA TTGGATTCTT 
GTTTCTCAGC AGCTGTTGGA TCTGCGCCAG 
TTCCTCCCCT CCCAATGCGG TTTAAAACAT 
TTTGGATTTG GATCAAGCAA GTGTCTTGCT 
CGCGCGCGGT AGGCCCGGGA CCAGCGGTCT 
TATTTTTTCC AGGACGTGGT AAAGGTGACT 
GCATAAGCCC GTCTCTGGGG TGGAGGTAGC 
TGCGGGGTGG TGTTGTAGAT GATCCAGTCG 



CTATTGCTTT 


ATTTGTAACC 


4950 


AACAATTGCA 


TTCATTTTAT 


5000 


TTTTTCGGAT 


CCTCTAGAGT 


5050 


ATTAACTACA 


AGGAACCCCT 


5100 


OGCTCGCTCG 


CTCACTGAGG 


5150 


GCTTTGCCCG 


GGCGGCCTCA 


5200 


AGGTGCTGAG 


GTACGATGAG 


5250 


GGCGGTAAAC 


ATATTAGGAA 


5300 


GCTGAGGCCC 


GATCACTTGG 


5350 


CTAGCGATGA 


AGATACAGAT 


5400 


AGGGTGGGAA 


AGAATATATA 


5450 


TTTTGCAGCA 


GCCGCCGCCG 


5500 


TTGTGAGCTC 


ATATTTGACA 


5550 


CAGAATGTGA 


TGGGCTCCAG 


5600 


CTCTACTACC 


TTGACCTACG 


5650 


CAGCCTCCGC 


CGCCGCTTCA 


5700 


ACTGACTTTG 


CTTTCCTGAG 


5750 


ATCCGCCCGC 


GATGACAAGT 


5800 


TGACCCGGGA 


ACTTAATGTC 


5850 


CAGGTTTCTG 


CCCTGAAGGC 


5900 


AAATAAAAAA 


CCAGACTCTG 


5950 


GTCTTTATTT 


AGGGGTTTTG 


6000 


CGGTCGTTGA 


GGGTCCTGTG 


6050 


CTGGATGTTC 


AGATACATGG 


6100 


ACCACTGCAG 


AGCTTCATGC 


6150 


TAGCAGGAGC 


GCTGGGCGTG 


6200 
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GTGCCTAAAA ATGTCTTTCA GTAGCAA6CT 
TGGTGTAAGT GTTTACAAAG CGGTTAAGCT 
GATATGAGAT GCATCTTGGA CTGTATTTTT 
CATATCCCTC CGGGGATTCA TGTTGTGCAG 
CGGTGCACTT GGGAAATTTG TCATGTAGCT 
AACTTGGAGA CGCCCTTGTG ACCTCCAAGA 
AATGATGGCA ATGGGCCCAC GGGCGGCGGC 
GATCACTAAC GTCATAGTTG TGTTCCAGGA 
TTTACAAAGC GCGGGCGGAG GGTGCCAGAC 
CGGCCCAGGG GCGTAGTTAC CCTCACAGAT 
GTTCAGATGG GGGGATCATG TCTACCTGCG 
TCCGGGGTAG GGGAGATCAG CTGGGAAGAA 
CGACTTACCG CAGCCGGTGG GCCCGTAAAT 
ACTGGTAGTT AAGAGAGCTG CAGCTGCCGT 
ACTTCGTTAA GCATGTCCCT GACTCGCATG 
CAGAAGGCGC TCGCCGCCCA GCGATAGGAG 
TTTTCAACGG TTTGAGACCG TCCGCCGTAG 
CCAAGCAGTT CCAGGCGGTC CCACAGCTOG 
TCGATCCAGC ATATCTCCTC GTTTCGCGGG 
CGGCAGTAGT CGGTGCTCGT CCAGACGGGC 
GGCGCAGGGT CCTCGTCAGC GTAGTCTGGG 
CCGGGCTGCG CGCTGGCCAG GGTGCGCTTG 
GAAGCGCTGC CGGTCTTCGC CCTGCGCGTC 
TGGTGTCATA GTCCAGCCCC TCCGCGGCGT 
CCCTTGGAGG AGGCGCCGCA CGAGGGGCAG 
GAGCTTGGGC GCGAGAAATA CCGATTCCGG 
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6ATT6CCAGG 


GGCAGGCCCT 


6250 


GGGATGGGTG 


CATACGTGGG 


6300 


AGGTTGGCTA 


TGTTCCCAGC 


6350 


AACCaCCAGC 


ACAGTGTATC 


6400 


TAGAAGGAAA 


TGCGTGGAAG 


6450 


TTTTCCATGC 


ATTCGTCCAT 


6500 


CTG6GC6AA6 


ATATTTCTGG 


6550 


TGAGATCGTC 


ATAGGCCATT 


6600 


TGC6GTATAA 


TGGTTCCATC 


6650 


TTGCATTTCC 


CACGCTTTGA 


6700 


6G6C6AT6AA 


GAAAACGGTT 


6750 


A6CAGGTTCC 


TGAGCAGCTG 


6800 


CACACCTATT 


ACCGGGTGCA 


6850 


CATCCCTGA6 


CAGGGGGGCC 


6900 


TTTTCCCTGA 


CCAAATCCGC 


6950 


TTCTTGCAAG 


GAAGCAAAGT 


7000 


GCATGCTTTT 


GAGCGTTTGA 


7050 


GTCACCTGCT 


CTACGGCATC 


7100 


TTGGGGCGGC 


TTTCGCTGTA 


7150 


CAGGGTCATG 


TCTTTCCACG 


7200 


TCACGGTGAA 


GGGGTGCGCT 


7250 


AGGCTGGTCC 


TGCTGGTGCT 


7300 


GGCCAGGTAG 


CATTTGACCA 


7350 


GGCCCTTGGC 


GCGCAGCTTG 


7400 


TGCAGACTTT 


TGAGGGCGTA 


7450 


GGAGTAGGCA 


TCCGCGCCGC 


7500 
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AGGCCCCGCA 6ACGGTCTCG 
TCGGGGTCAA AAACCAGGTT 
TCTGGTTTCC ATGAGCCGGT 
TGTCCCCGTA TACAGACTTG 
AGCCTTCAAC CCAGTCAGCT 
TCGCCGCACT TATGACTGTC 
CCGGCAGCGC TCTGGGTCAT 
GACGATGATC GGCCTGTCGC 
CTCAAGCCTT CGTCACTGGT 
GCCATTATCG CCGGCATGGC 
GTTCGCGAOG CGAGGCTGGA 
CCGGCGGCAT CGGGATGCCC 
GATGACGACC ATCAGGGACA 
CCTAACTTCG ATCACTGGAC 
CGGCGAGCAC ATGGAACGGG 
CTTGTCTGCC TCCCCGCGTT 
GACCTGAATG GAAGCCGGCG 
AATTGGAGCC AATCAATTCT 
CTTGGCAGAA CATATCCATC 
CGCATCTCGG GCAGCGTTGG 
CCTGTCGTTG AGGACCCGGC 
AGAATGAATC ACCGATACGC 
ACGTCTGCGA CCTGAGCAAC 
GTAAAGTCTG GAAACGCGGA 
TCTGCATCGC AGGATGCTGC 
TTAACGAAGC CTTTCTCAAT 



55 

CATTCCACGA GCCAGGTGAG 
TCCCCCATGC TTTTTGATGC 
GTCCACGCTC GGTGACGAAA 
AGAGGCCTGT CCTcGACCGA 
CCTTCCGGTG GGCGCGGGGC 
TTCTTTATCA TGCAACTCGT 
TTTCGGCGAG GACCGCTTTC 
TTGCGGTATT CGGAATCTTG 
CCCGCCACCA AACGTTTCGG 
GGCCGACGCG CTGGGCTACG 
TGGCCTTCCC CATTATGATT 
GCGTTGCAGG CCATGCTGTC 
GCTTCAAGGA TCGCTCGCGG 
CGCTGATCGT CACGGCGATT 
TTGGCATGGA TTGTAGGCGC 
GCGTCGCGGT GCATGGAGCC 
GCACCTCGCT AACGGATTCA 
TGCGGAGAAC TGTGAATGCG 
GCGTCCGCCA TCTCCAGCAG 
6TCCTGGCCA CGGGTGCGCA 
TAGGCTGGCG GGGTTGCCTT 
GAGCGAACGT GAAGCGACTG 
AACATGAATG GTCTTCGGTT 
AGTCAGCGCC CTGCACCATT 
TGGCTACCCT GTGGAACACC 
GCTCACGCTG TAGGTATCTC 



CTCTGGCCGT 


7550 


GTTTCTTACC 


7600 


AGGCTGTCCG 


7650 


TGCCCTTGAG 


7700 


ATGACTATCG 


7750 


AGGACAGGTG 


7800 


GCTGGAGCGC 


7850 


CACGCCCTCG 


7900 


CGAGAAGCAG 


7950 


TCTTGCTGGC 


8000 


CTTCTCGCTT 


8050 


CAGGCAGGTA 


8100 


CTCTTACCAG 


8150 


TATGCCGCCT 


8200 


CGCCCTATAC 


8250 


GGGCCACCTC 


8300 


CCACTCCAAG 


8350 


CAAACCAACC 


8400 


CCGCACGCGG 


8450 


TGATCGTGCT 


8500 


ACTGGTTAGC 


8550 


CTGCTGCAAA 


8600 


TCCGTGTTTC 


8650 


ATGTTCCGGA 


8700 


TACATCTGTA 


8750 


AGTTCGGTGT 


8800 
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A6GTC6TTC6 CTCCAA6CT6 
GACCGCTGCG CCTTATCCGG 
ACACGACTTA TCGCCACTGG 
CGAGGTATGT AGGCGGTGCT 
GGCTACACTA GAAGGACAGT 
TACCTTCGGA AAAAGAGTTG 
CTGGTAGCGG TGGTTTTTTT 
AAAGGATCTC AAGAAGATCC 
GTGGAACGAA AACTCACGTT 
GGATCTTCAC CTAGATCCTT 
TAAAGTATAT ATGAGTAAAC 
TGAGGCACCT ATCTCAGCGA 
GACTCCCCGT CGTGTAGATA 
CCCAGTGCTG CAATGATACC 
ATCAGCAATA AACCAGCCAG 
CAACTTTATC CGCCTCCATC 
GTAAGTAGTT CGCCAGTTAA 
AGGCATCGTG GTGTCACGCT 
GTTCCCAACG ATCAAGGCGA 
GCGGTTAGCT CCTTCGGTCC 
AGTGTTATCA CTCATGGTTA 
TGCCATCCGT AAGATGCTTT 
TTCTGAGAAT AGTGTATGCG 
ACGGGATAAT ACCGCGCCAC 
GAAAACGTTC TTCGGGGCGA 
TCCAGTTCGA TGTAACCCAC 
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GGCTGTGTGC ACGAACCCCC 
TAACTATCGT CTTGAGTCCA 
CAGCAGCCAC TGGTAACAGG 
ACAGAGTTCT TGAAbTGGTG 
ATTTGGTATC TGCGCTCTGC 
GTAGCTCTTG ATCCGGCAAA 
GTTTGCAAGC AGCAGATTAC 
TTTGATCTTT TCTACGGGGT 
AAGGGATTTT GGTCATGAGA 
TTAAATTAAA AATGAAGTTT 
TTGGTCTGAC AGTTACCAAT 
TCTGTCTATT TCGTTCATCC 
ACTACGATAC GGGAGGGCTT 
GCGAGACCCA CGCTCACCGG 
CCGGAAGGGC CGAGCGCAGA 
CAGTCTATTA ATTGTTGCCG 
TAGTTTGCGC AACGTTGTTG 
CGTCGTTTGG TATGGCTTCA 
GTTACATGAT CCCCCATGTT 
TCCGATCGTT GTCAGAAGTA 
TGGCAGCACT GCATAATTCT 
TCTGTGACTG GTGAGTACTC 
GCGACCGAGT TGCTCTTGCC 
ATAGCAGAAC TTTAAAAGTG 
AAACTCTCAA GGATCTTACC 
TCGTGCACCC AACTGATCTT 



CGTTCAGCCC 


8850 


ACCCGGTAAG 


8900 


ATTAGCAGAG 


8950 


GCCTAACTAC 


9000 


TGAAGCCAGT 


9050 


CAAACCACCG 


9100 


GCGCAGAAAA 


9150 


CTGACGCTCA 


9200 


TTATCAAAAA 


9250 


TAAATCAATC 


9300 


GCTTAATCAG 


9350 


ATAGTTGCCT 


9400 


ACCATCTGGC 


9450 


CTCCAGATTT 


9500 


AGTGGTCCTG 


9550 


GGAAGCTAGA 


9600 


CCATTGCTGC 


9650 


TTCAGCTCCG 


9700 


GTGCAAAAAA 


9750 


AGTTGGCCGC 


9800 


CTTACTGTCA 


9850 


AACCAAGTCA 


9900 


CGGCGTCAAC 


9950 


CTCATCATTG 


10000 


GCTGTTGAGA 


10050 


CAGCATCTTT 


10100 
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TACTTTCACC AGCGTTTCTG 
CAAAAAA6GG AATAAGGGC6 
CTTTTTCAAT ATTATTGAAG 
ATACATATTT GAATGTATTT 
CATTTCCCCG AAAAGTGCCA 
ACATTAACCT ATAAAAATAG 
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3GTGAGCAAA AACAGGAAGG 
ACACGGAAAT GTTGAATACT 
CATTTATCAG GGTTATTGTC 
AGAAAAATAA ACAAATAGGG 
CCTGACGTCT AAGAAACCAT 
GCGTATCACG AGGCCCTTTC 



CAAAATGCCG 10150 

CATACTCTTC 10200 

TCATGAGCGG 10250 

GTTCCGCGCA 10300 

TATTATCATG 10350 

GTCTTCAA 10398 



(2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



TCGCGCGTTT 


CGGTGATGAC 


GGTGAAAACC 


TCTGACACAT 


GCAGCTCCCG 


50 


GAGACGGTCA 


CAGCTTGTCT 


GTAAGCGGAT 


GCCGGGAGCA 


GACAAGCCCG 


100 


TCAGGGCGCG 


TCAGCGGGTG 


TTGGCGGGTG 


TCGGGGCTGG 


CTTAACTATG 


150 


CGGCATCAGA 


GCAGATTGTA 


CTGAGAGTGC 


ACCATATGCG 


GTGTGAAATA 


200 


CCGCACAGAT 


GCGTAAGGAG 


AAAATACCGC 


ATCAGGCGCC 


ATTCGCCATT 


250 


CAGGCTGCGC 


AACTGTTGGG 


AAGGGCGATC 


GGTGCGGGCC 


TCTTCGCTAT 


300 


TACGCCAGCT 


GGCGAAAGGG 


GGATGTGCTG 


CAAGGCGATT 


AAGTTGGGTA 


350 


ACGCCAGGGT 


TTTCCCAGTC 


ACGACGTTGT 


AAAACGACGG 


CCAGTGCCAA 


400 


GCTTGCATGC 


CTGCAGGTCG 


ACTCTAGAGG 


ATCCGAAAAA 


ACCTCCCACA 


450 


CCTCCCCCTG 


AACCTGAAAC 


ATAAAATGAA 


TGCAATTGTT 


GTTGTTAACT 


500 


TGTTTATTGC 


AGCTTATAAT 


GGTTACAAAT 


AAAGCAATAG 


CATCACAAAT 


550 


TTCACAAATA 


AAGCATTTTT 


TTCACTGCAT 


TCTAGTTGTG 


GTTTGTCCAA 


600 


ACTCATCAAT 


GTATCTTATC 


ATGTCTGGAT 


CCCCGCGGCC 


GCCAAATCAT 


650 
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TTATTGTTCA AAGATGCAGT CATCCAAATC 
CAGTGCAAGC GTCTGGCACC TTTCCCATGA 
TTCTGATACG CCTTTTTGAC GACAGAAACG 
AAAGCACTCT AAACAGTCTT TCTGTCCGTG 
TCTGATTCAT TCTCTCGCAT TGTCTGCAGG 
CCCACGTGAC GAGAACATTT GTTTTGGTAC 
AGCTTCCGCG TCTGACGTCG ATGGCTGCGC 
TGGGCTCACT TATATCTGCG TCACTGGGGG 
CCCTTTTTGA CGTAGAATTC ATGCTCCACC 
CCACCGGAAA AAGTCTTTGA CTTCCTGCTT 
GATCCAGACG GCGGGTGAGT TCAAATTTGA 
TGCTGGTGTT CGAAGGTCGT TGAGTTCCCG 
GGTGTTGGAG GTGACGATCA CGGGAGTCGG 
TGCATTTCTG GTCCACGCGC ACCTTGCTTC 
GACTCCACGA CCTTGGCGGT CATCTTCCCC 
CTTGTCGACA CAGTCGTTGA AGGGAAAGTT 
ACCCGTAGAA GGGCACAGTG TGGGCTATGG 
CCGGTAGTTG CAGGCCCAAA CAGCCAGATG 
TTTCGTGGCC CATCCCAGAA AGACGGAAGC 
CGTTTAGTTC CAAAATTTTA TAAATCCGAT 
GGCTGCTGGC CCACCAGGTA GTCGGGGGCG 
CTTTCCCGCA TTGTCCAAGG CAGCCTTGAT 
CCGCATTGAA GGAGATGTAT GAGGCCTGGT 
TCCGAGGTAA TCCCCTTGTC CACGAGCCAC 
GGCTGAAGTT TTTGATCTGA TCACCGGCGC 
TCTCTTTGTT CTGCTCCTGC GTCTGCGACA 



CACATTGACC 


AGATCGCAGG 


700 


TATGATGAAT 


GTAGCACAGT 


750 


GGTTGAGATT 


CTGACACGGG 


800 


AGTGhAGCAG 


ATATTTGAAT 


850 


GAAACAGCAT 


CAGATTCATG 


900 


CTGTCTGCGT 


AGTTGATCGA 


950 


AACTGACTCG 


CGCACCCGTT 


1000 


CGGGTCTTTT 


CTTGGCTCCA 


1050 


TCAACCACGT 


GATCCTTTGC 


1100 


GGTGACCTTC 


CCAAAGTCAT 


1150 


ACATCCGGTC 


TTGCAACGGC 


1200 


TCAATCACGG 


CGCACATGTT 


1250 


GTCTATCTGG 


GCCGAGGACT 


1300 


CTCCGAGAAT 


GGCTTTGGCC 


1350 


TCCTCCCACC 


AGATCACCAT 


1400 


CTCATTGGTC 


CAGTTTACGC 


1450 


CCTCCGCGAT 


GTTGGTCTTC 


1500 


GTGTTCCTCT 


TGCCGAACTT 


1550 


CGCATATTGG 


GGATCGTACC 


1600 


TGCTGGAAAT 


GTCCTCCACG 


1650 


GTTTTAGTCA 


GGCTCATAAT 


1700 


TTGGGACCGC 


GAGTTGGAGG 


1750 


CCTCCTGGAT 


CCACTGCTTC 


1800 


CCGACCAGCT 


CCATGTACCT 


1850 


ATCAGAATTG 


GGATTCTGAT 


1900 


CGTGCGTCAG 


ATGCTGCGCC 


1950 
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ACCAACCGTT 


TACGCTCCGT 


3AGATTCAAA 


CAGGCGCTTA 


AATACTGTTC 


2000 


CATATTAGTC 


CACGCCCACT 


GGAGCTCAGG 


CTGGGTTTTG 


GGGAGCAAGT 


2050 


AATTGGGGAT 


GTAGCACTCA 


TCCACCACCT 


TGTTCCCGCC 


TCCGGCGCCA 


2100 


TTTCTGGTCT 


TTGTGACCGC 


GAACCAGTTT 


GGCAAAGTCG 


GCTCGATCCC 


2150 


6CGGTAAATT 


CTCTGAATCA 


GTTTTTCGCG 


AATCTGACTC 


AGGAAACGTC 


2200 


CCAAAACCAT 


GGATTTCACC 


CCGGTGGTTT 


CCACGAGCAC 


GTGCATGTGG 


2250 


AA6TA6CTCT 


CTCCCTTCTC 


AAATTGCACA 


AAGAAAAGGG 


CCTCCGGGGC 


2300 


CTTACTCACA 


CGGCGCCATT 


CCGTCAGAAA 


GTCGCGCTGC 


AGCTTCTCGG 


2350 


CCAC66TCA6 


GGGTGCCTGC 


TCAATGAGAT 


TCAGATCCAT 


GTCAGAATCT 


2400 


G6C6GCAACT 


CCCATTCCTT 


CTCGGCCACC 


CAGTTCACAA 


AGCTGTCAGA 


2450 


AATGCC6GGC 


AGATGCCOGT 


CAAGGTCGCT 


GGGGACCTTA 


ATCACAATCT 


2500 


C6TAAAACCC 


CGGCATGGCG 


GCTGCGCGTT 


CAAACCTCCC 


GCTTCAAAAT 


2550 


6GAGACCCTG 


CGTGCTCACT 


CGGGCTTAAA 


TACCCAGCGT 


GACCACATGG 


2600 


T6T0GCAAAA 


TGTCGCAAAA 


CACTCACGTG 


ACCTCTAATA 


CAGGACTCTA 


2650 


GAG6ATCCCC 


GGGTACCGAG 


CTCGAATTCG 


TAATCATGGT 


CATAGCTGTT 


2700 


TCCTGTGTGA 


AATTGTTATC 


CGCTCACAAT 


TCCACACAAC 


ATACGAGCCG 


2750 


GAA6CATAAA 


GTGTAAAGCC 


TGGGGTGCCT 


AATGAGTGAG 


CTAACTCACA 


2800 


TTAATTGC6T 


TGCGCTCACT 


GCCCGCTTTC 


GAGTCGGGAA 


ACCTGTCGTG 


2850 


CCAGCTGCAT 


TAATGAATCG 


GCCAACGCGC 


GGGGAGAGGC 


GGTTTGOGTA 


2900 


TTGGGCGCTC 


TTCCGCTTCC 


TCGCTCACTG 


ACTCGCTGCG 


CTCGGTCGTT 


2950 


CGGCTGCGGC 


GAGCGGTATC 


AGCTCACTCA 


AAGGCGGTAA 


TACGGTTATC 


3000 


CACAGAATCA 


GGGGATAACG 


CAGGAAAGAA 


CATGTGAGCA 


AAAGGCCAGC 


3050 


AAAAGGCCAG 


GAACCGTAAA 


AAGGCCGCGT 


TGCTGGCGTT 


TTTCCATAGG 


3100 


CTCCGCCCCC 


CTGACGAGCA 


TCACAAAAAT 


CGACGCTCAA 


GTCAGAGGTG 


3150 


GCGAAACCCG 


ACAGGACTAT 


AAAGATACCA 


GGCGTTTCCC 


CCTGGAAGCT 


3200 


CCCTCGTGCG 


CTCTCCTGTT 


CCGACCCTGC 


CGCTTACCGG 


ATACCTGTCC 


3250 
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GCCTTTCTCC 


CTTCGGGAAG 


CGTGGCGCTT 


TCTCATAGCT 


CACGCTGTAG 


3300 


GTATCTCAGT 


TCGGTGTAGG 


TCGTTCGCTC 


CAAGCTGGGC 


TGTGTGCACG 


3350 


AACCCCCCGT 


TCAGCCCGAC 


CGCTGCGCCT 


TATCCGGTAA 


CTATCGTCTT 


3400 


GAGTCCAACC 


CGGTAAGACA 


CGACTTATCG 


CCAC1GGCAG 


CAGCGACTGG 


3450 


TAACAGGATT 


AGGAGAGCGA 


GGTATGTAGG 


CGGTGCTACA 


GAGTTCTTGA 


3500 


AGTGGTGGCC 


TAACTACGGC 


TACACTAGAA 


GGACAGTATT 


TGGTATCTGC 


3550 


GCTCTGCTGA 


AGCCAGTTAC 


CTTCGGAAAA 


AGAGTTGGTA 


GCTCTTGATC 


3600 


CGGCAAACAA 


ACCACCGCTG 


GTAGCGGTGG 


TTTTTTTGTT 


TGCAAGCAGC 


3650 


AGATTACGCG 


CAGAAAAAAA 


GGATCTCAAG 


AAGATCCTTT 


GATCTTTTCT 


3700 


ACGGGGTCTG 


ACGCTCAGTG 


GAACGAAAAC 


TCACGTTAAG 


GGATTTTGGT 


3750 


CATGAGATTA 


TCAAAAAGGA 


TCTTCACCTA 


GATCCTTTTA 


AATTAAAAAT 


3800 


GAAGTTTTAA 


ATCAATCTAA 


AGTATATATG 


AGTAAACTTG 


GTCTGACAGT 


3850 


TACCAATGCT 


TAATCAGTGA 


GGCACCTATC 


TCAGCGATCT 


GTCTATTTCG 


3900 


TTCATCCATA 


GTTGCCTGAC 


TCCCCGTCGT 


GTAGATAACT 


ACGATACGGG 


3950 


AGGGCTTACC 


ATCTGGCCCC 


AGTGCTGCAA 


TGATACCGCG 


AGACCCACGC 


4000 


TCACCGGCTC 


CAGATTTATC 


AGCAATAAAC 


CAGCCAGCCG 


GAAGGGCCGA 


4050 


GCGCAGAAGT 


GGTCCTGCAA 


CTTTATCCGC 


CTCCATCCAG 


TCTATTAATT 


4100 


GTTGCCGGGA 


AGCTAGAGTA 


AGTAGTTCGC 


CAGTTAATAG 


TTTGCGCAAC 


4150 


GTTGTTGCCA 


TTGCTACAGG 


CATCGTGGTG 


TCACGCTCGT 


CGTTTGGTAT 


4200 


GGCTTCATTC 


AGCTCCGGTT 


CCCAACGATC 


AAGGCGAGTT 


ACATGATCCC 


4250 


CCATGTTGTG 


CAAAAAAGCG 


GTTAGCTCCT 


TCGGTCCTCC 


GATCGTTGTC 


4300 


AGAAGTAAGT 


TGGCCGCAGT 


GTTATCACTC 


ATGGTTATGG 


CAGCACTGCA 


4350 


TAATTCTCTT 


ACTGTCATGC 


CATCCGTAAG 


ATGCTTTTCT 


GTGACTGGTG 


4400 


AGTACTCAAC 


CAAGTCATTC 


TGAGAATAGT 


GTATGCGGCG 


ACCGAGTTGC 


4450 


TCTTGCCCGG 


CGTCAATACG 


GGATAATACC 


GCGCCACATA 


GCAGAACTTT 


4500 


AAAAGTGCTC 


ATCATTGGAA 


AACGTTCTTC 


GGGGCGAAAA 


CTCTCAAGGA 


4550 
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TCTTACCGCT GTTGAGATCC 
TGATCTTCAG CATCTTTTAC 
AGGAAGGCAA AATGCCGCAA 
GAATACTCAT ACTCTTCCTT 
TATTGTCTCA TGAGCGGATA 
AATAGGGGTT CCGCGCACAT 
AAACCATTAT TATCATGACA 
CCCTTTCGTC 
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AGTTCGATGT AACCCACTCG 
TTTCACCAGC GTTTCTGGGT 
AAAAGGGAAT AAGGGCGACA 
TTTCAATATT ATTGAAGCAT 
CATATTTGAA TGTATTTAGA 
TTCCCCGAAA AGTGCCACCT 
TTAACCTATA AAAATAGGCG 



TGCACCCAAC 


4600 


GAGCAAAAAC 


4650 


CGGAAATGTT 


4700 


TTATCAGGGT 


4750 


AAAATAAACA 


4800 


GACGTCTAAG 


4850 


TATCACGAGG 


4900 




4910 
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WHAT IS CLAIMED IS. 

1. A recombinant hybrid virus comprising: 

(a) DNA sequences of , or corresponding 
to, the 5 1 inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5' adenovirus packaging/ enhancer 
domain; 

(b) DNA sequences of, or corresponding 
to, the 5* adeno-associated virus (AAV) ITR sequences; 

(c) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(d) DNA sequences of, or corresponding 

to, the 3 1 AAV ITR sequences; 

(e) DNA sequences of, or corresponding 
to, the 3 1 adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell. 

2. The virus according to claim 1 wherein 
said adenovirus is rendered replication defective by a 
deletion in all or a part of the El gene, 

3. The virus according to claim 2 wherein 
said adenovirus genome has a deletion in all or a part of 
the E3 gene. 
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4. The virus according to claim 1 wherein 
said adenovirus genome comprising deletions in the DNA 
sequences of all or a portion of the adenovirus genes 
selected from the group consisting of the E2a gene, the 
E4 gene, the late genes LI through L5, the intermediate 
genes IX and IV a , and a combination thereof. 

5. The virus according to claim 1 wherein 
said selected gene is a reporter gene. 

6. The virus according to claim 5 wherein 
said reporter gene is selected from the group consisting 
of the genes encoding 6-galactosidase, alkaline 
phosphatase and green fluorescent protein. 

7. The virus according to claim 1 wherein 
said selected gene is a therapeutic gene. 

8. The virus according to claim 7 wherein 
said therapeutic gene is selected from the group 
consisting of a normal CFTR gene and a normal LDL gene. 

9. The virus according to claim 1 further 
comprising the DNA of, or corresponding to, a functional 
portion of the genome of an adeno-associated virus rep 
gene. 

10. A recombinant hybrid vector comprising: 

(a) DNA sequences of, or corresponding 
to, the 5 1 inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5 1 adenovirus packaging/ enhancer 
domain; 

(b) DNA s quences of, or corresponding 
to, the 5 1 adeno-associat d virus (AAV) ITR sequences; 
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(c) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro j 

(d) DNA sequences of, or corresponding 
to, the 3' AAV ITR sequences; 

(e) DNA sequences of, or corresponding 
to, the 3« adenovirus ITR sequences; and 

(d) plasmid DNA sequences containing 
regulatory elements* 

11. A recombinant trans-infection particle 
comprising: 

(a) a recombinant hybrid virus comprising: 

(i) DNA sequences of, or corresponding 
to, the 5' inverted terminal repeat (ITR) sequences of an 
adenovirus and the 5' adenovirus packaging/ enhancer 
domain; 

(ii) DNA sequences of, or corresponding 
to, the 5 1 adeno-associated virus (AAV) ITR sequences; 

(iii) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(iv) DNA sequences of, or corresponding 
to , the 3 1 AAV ITR sequences ; 

(v) DNA sequences of, or corresponding 
to, the 3 f adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell; 

(b) a polycation sequence conjugated to said 
hybrid virus; and 
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(c) a plasmid comprising an AAV rep gene 
operatively linked to regulatory sequences capable of 
directing its expression, said plasmid associated with 
said polycation sequence. 

12. The trans-infection particle according to 
claim 11 wherein said adenovirus DNA lacks the sequence 
encoding viral genes. 

13. The trans-infection particle according to 
claim 11 wherein said adenovirus genome is rendered 
replication-defective by a deletion in all or a part of 
the El gene. 

14. The particle according to claim 13 wherein 
said adenovirus genome has a deletion in all or a part of 
the E3 gene. 

15. The particle according to claim 11 wherein 
said adenovirus genome has deletions in the DNA sequences 
of all or a portion of the adenovirus genes selected from 
the group consisting of the E2a gene, the E4 gene, the 
late genes LI through L5, the intermediate genes IX and 
IV a , and a combination thereof. 

16. The particle according to claim 11 wherein 
said selected gene is a reporter gene. 

17. The particle according to claim 16 wherein 
said reporter gene is selected from the group consisting 
of the genes encoding 6-galactosidase, alkaline 
phosphatase and gr en fluorescent protein. 

18. The particle according to claim 11 wherein 
said selected gene is a therapeutic gene. 
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19. The particle according to claim 18 wherein 
said therapeutic gene is selected from the group 
consisting of a normal CFTR gene and a normal LDL gene. 

20. A composition for use in delivering and 
stably integrating a selected gene into the chromosome of 
a target cell, said composition comprising 

(a) a recombinant hybrid virus 

comprising: 

(i) DNA sequences of, or 
corresponding to, the 5' inverted terminal repeat (ITR) 
sequences of an adenovirus and the 5' adenovirus 
packaging/ enhancer domain; 

(ii) DNA sequences of, or 
corresponding to, the 5' adeno-associated virus (AAV) ITR 
sequences ; 

(iii) a gene encoding a selected 
protein operatively linked to regulatory sequences 
directing expression of the protein in a target cell in 
vivo or in vitro; 

(iv) DNA sequences of, or 
corresponding to, the 3 f AAV ITR sequences; 

(v) DNA sequences of, or 
corresponding to, the 3 1 adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 
target cell; and 

(b) a pharmaceutically acceptable 

carrier. 

21. The composition according to claim 20 
further comprising a plasmid comprising an AAV rep gene 
under the control of regulatory sequences capable of 
expressing said rep gen in said targ t cell. 
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22. Th« composition according to claim 21 
wherein said vector further comprises the DNA of, or 
corresponding to, at least a functional portion of the 
genome of an adeno-associated virus rep gene. 

23. A composition for use in delivering and 
stably integrating a selected gene into the chromosome oi 
a target cell comprising an effective amount of a 
recombinant trans-infection particle in a 
pharmaceutically acceptable carrier, said particle 

comprising: 

(a) a recombinant hybrid virus comprising: 

(i) DNA sequences of, or corresponding 
to, the 5' inverted terminal repeat (ITR) sequences of ai 
adenovirus and the 5' adenovirus packaging/ enhancer 
domain; 

(ii) DNA sequences of, or corresponding 
to, the 5' adeno-associated virus (AAV) ITR sequences; 

(iii) a gene encoding a selected protein 
operatively linked to regulatory sequences directing 
expression of the protein in a target cell in vivo or in 
vitro; 

(iv) DNA sequences of, or corresponding 

to, the 3* AAV ITR sequences; 

(v) DNA sequences of, or corresponding 

to, the 3' adenovirus ITR sequences; 

wherein said virus is replication- 
defective and is provided with a sufficient portion of 
the genome of the adenovirus to permit infection of the 

target cell; 

(b) a polycation sequence conjugated to said 

hybrid virus; 
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(c) a plasmid comprising an AAV rep gene 
operatively linked to regulatory sequences capable of 
directing its expression, said plasmid associated with 
said polycation sequence. 

24. A mammalian cell capable of expressing a 
selected gene introduced therein through transduction of 
the virus of claim 1, the vector of claim 10, or the 
trans- infection particle of claim 11. 

25. A method for producing high levels of a 
recombinant adeno-associated virus comprising the steps 
of 

(a) culturing a cell co-trans fected with 
the vector of claim 10 and an optional helper virus in 
the presence of a plasmid containing an AAV rep gene 
under the control of regulatory sequences capable of 
expressing said rep gene; and 

(b) isolating from said culture a 

recombinant AAV. 

26. A method for producing high levels of a 
recombinant adeno-associated virus comprising the steps 
of 

(a) culturing a cell transfected with the 

particle of claim 11 and 

(b) isolating from said culture a 

recombinant AAV. 
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FIGURE 2A 



GAATTCGCTA GCATCATCAA TAATATACCT 
GGGGGTGGAG TTTGTGACGT GGCGCGGGGC 
GGCGGAAGTG TGATGTTGCA AGTGTGGCGG 
TGACGTTTTT GGTGTGCGCC GGTGTACACA 
GGATGTTGTA GTAAATTTGG GCGTAACCGA 
GAATAAGAGG AAGTGAAATC TGAATAATTT 
AGGGAGATCT GCTGCGCGCT CGCTCGCTCA 
GGGCGACCTT TGGTCGCCCG GCCTCAGTGA 
ACTCCATCAC TAGGGGTTCC TTGTAGTTAA 
TTCGAGCTTG CATGCCTGCA GGTCGTTACA 
CCGCCCAACG ACCCCCGCCC ATTGACGTCA 
ATAGGGACTT TCCATTGACG TCAATGGGTG 
GTACATCAAG TGTATCATAT GCCAAGTACG 
CCCGCCTGGC ATTATGCCCA GTACATGACC 
TACGTATTAG TCATCGCTAT TACCATGGTG 
GGATAGCGGT TTGACTCACG GGGATTTCCA 
TTGTTTTGGC ACCAAAATCA ACGGGACTTT 



TATTTTGGAT TGAAGCCAAT ATGATAATGA 

60 



GTGGGAACGG GGCGGGTGAC GTAGTAGTGT 

120 



AACACATGTA AGCGACGGAT GTGGCAAAAG 

180 



GGAAGTGACA ATTTTCGCGC GGTTTTAGGC 

240 



GTAAGATTTG GCCATTTTCG CGGGAAAACT 

300 



TGTGTTACTC ATAGCGCGTA ATATTTGTCT 

360 

CTGAGGCCGC CCGGGCAAAG CCCGGGCGTC 

420 



GCGAGCGAGC GCGCAGAGAG GGAGTGGCCA 

480 



TGATTAACCC GCCATGCTAC TTATCTACAA 

540 



TAACTTACGG TAAATGGCCC GCCTGGCTGA 

600 

ATAATGACGT ATGTTCCCAT AGTAACGCCA 

660 



GAGTATTTAC GGTAAACTGC CCACTTGGCA 

720 



CCCCCTATTG ACGTCAATGA CGGTAAATGG 

780 



TTATGGGACT TTCCTACTTG GCAGTACATC 

840 



ATGCGGTTTT GGCAGTACAT CAATGGGCGT 

900 



AGTCTCCACC CCATTGACGT CAATGGGAGT 

960 



CCAAAATGTC GTAACAACTC CGCCCCATTG 

1020 



WO 96/13598 

ACGCAAATGG GCGGTAGGCG 
AACCGTCAGA TCGCCTGGAG 
GACCGATCCA GCCTCCGGAC 
GTTAACTGGT AAGTTTAGTC 
CAAATCAAAG AACTGCTCCT 
TTACTTCTGC TCTAAAAGCT 
GAGCCTGCTA AAGCAAAAAA 
TTTTCGTTGC CGGTCTGGGA 
ATCCCGTCGT TTTACAACGT 
TTGCAGCACA TCCCCCTTTC 
CTTCCCAACA GTTGCGCAGC 
AAGCGGTGCC GGAAAGCTGG 
CCTCAAACTG GCAGATGCAC 
TTACGGTCAA TCCGCCGTTT 
TTAATGTTGA TGAAAGCTGG 
ACTCGGCGTT TCATCTGTGG 
TGCCGTCTGA ATTTGACCTG 
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FIGURE 2B 

TGTACGGTGG GAGGTCTATA 

ACGCCATCCA CGCTGTTTTG 

TCTAGAGGAT CCoGTACTCG 

TTTTTGTCTT TTATTTCAGG 

CAGTGGATGT TGCCTTTACT 

GCGGAATTGT ACCCGCGGCC 

GAAGTCACCA TGTCGTTTAC 

GGCATTGGTC TGGACACCAG 

CGTGACTGGG AAAACCCTGG 

GCCAGCTGGC GTAATAGCGA 

CTGAATGGCG AATGGCGCTT 

CTGGAGTGCG ATCTTCCTGA 

GGTTACGATG CGCCCATCTA 

GTTCCCACGG AGAATCCGAC 

CTACAGGAAG GCCAGACGCG 

TGCAACGGGC GCTGGGTCGG 

AGCGCATTTT TACGCGCCGG 



PCTAJS95/14018 



TAAGGAGAGC TCGTTTAGTG 

1080 

ACCTCCATAG AAGACACCGG 

1140 

AGGAACTGAA AAACCAGAAA 

1200 



TCCCGGATCC GGTGGTGGTG 

1260 



TCTAGGCCTG TACGGAAGTG 

1320 



GCAATTCCCG GGGATCGAAA 

1380 

TTTGACCAAC AAGAACGTGA 

1440 



CAAGGAGCTG CTCAAGCGCG 

1500 

CGTTACCCAA CTTAATCGCC 

1560 

AGAGGCCCGC ACCGATCGCC 

1620 

TGCCTGGTTT CCGGCACCAG 

1680 

GGCCGATACT GTCGTCGTCC 

1740 

CACCAACGTA ACCTATCCCA 

1800 

GGGTTGTTAC TCGCTCACAT 

1860 

AATTATTTTT GATGGCGTTA 

1920 

TTACGGCCAG GACAGTCGTT 

1980 

AGAAAACCGC CTCGCGGTGA 

2040 
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FIGURE 2C 



TGGTGCTGCG TTGGAGTGAC GGCAGTTATC 
GCATTTTCCG TGACGTCTCG TTGCTGCATA 
TTGCCACTCG CTTTAATGAT GATTTCAGCC 
GCGGCGAGTT GCGTGACTAC CTACGGGTAA 
TCGCCAGCGG CACCGCGCCT TTCGGCGGTG 
ATCGCGTCAC ACTACGTCTG AACGTCGAAA 
ATCTCTATCG TGCGGTGGTT GAACTGCACA 
CCTGCGATGT CGGTTTCCGC GAGGTGCGGA 
AGCCGTTGCT GATTCGAGGC GTTAACCGTC 
TGGATGAGCA GACGATGGTG CAGGATATCC 
TGCGCTGTTC GCATTATCCG AACCATCCGC 
TGTATGTGGT GGATGAAGCC AATATTGAAA 
CCGATGATCC GCGCTGGCTA CCGGCGATGA 
ATCGTAATCA CCCGAGTGTG ATCATCTGGT 
ATCACGACGC GCTGTATCGC TGGATCAAAT 
AAGGCGGCGG AGCCGACACC ACGGCCACCG 
ATGAAGACCA GCCCTTCCCG GCTGTGCCGA 



TGGAAGATCA GGATATGTGG CGGATGAGCG 

2100 

AACCGACTAC ACAAATCAGC GATTTCCATG 

2160 

GCGCTGTACT GGAGGCTGAA GTTCAGATGT 

2220 

CAGTTTCTTT ATGGCAGGGT GAAACGCAGG 

2280 

AAATTATCGA TGAGCGTGGT GGTTATGCCG 

2340 

ACCCGAAACT GTGGAGCGCC GAAATCCCGA 

2400 

CCGCCGACGG CACGCTGATT GAAGCAGAAG 

2460 

TTGAAAATGG TCTGCTGCTG CTGAACGGCA 

2520 

ACGAGCATCA TCCTCTGCAT GGTCAGGTCA 

2580 

TGCTGATGAA GCAGAACAAC TTTAACGCCG 

2640 

TGTGGTACAC GCTGTGCGAC CGCTACGGCC 

2700 

CCCACGGCAT GGTGCCAATG AATCGTCTGA 

2760 

GCGAACGCGT AACGCGAATG GTGCAGCGCG 

2820 

CGCTGGGGAA TGAATCAGGC CACGGCGCTA 

2880 

CTGTCGATCC TTCCCGCCCG GTGCAGTATG 

2940 

ATATTATTTG CCCGATGTAC GCGCGCGTGG 

3000 

AATGGTCCAT CAAAAAATGG CTTTCGCTAC 

3060 
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FIGURE 2D 



CTGGAGAGAC GCGCCCGCTG ATCCTTTGCG 
GCGGTTTCGC TAAATACTGG CAGGCGTTTC 
TCTGGGACTG GGTGGATCAG TCGCTGATTA 
CTTACGGCGG TGATTTTGGC GATACGCCGA 
TCTTTGCCGA CCGCACGCCG CATCCAGCGC 
TCCAGTTCCG TTTATCCGGG CAAACCATCG 
GCGATAACGA GCTCCTGCAC TGGATGGTGG 
AAGTGCCTCT GGATGTCGCT CCACAAGGTA 
AGCCGGAGAG CGCCGGGCAA CTCTGGCTCA 
CATGGTCAGA AGCCGGGCAC ATCAGCGCCT 
GTGTGACGCT CCCCGCCGCG TCCCACGCCA 
TTTGCATCGA GCTGGGTAAT AAGCGTTGGC 
AGATGTGGAT TGGCGATAAA AAACAACTGC 
CACCGCTGGA TAACGACATT GGCGTAAGTG 
TCGAACGCTG GAAGGCGGCG GGCCATTACC 
CAGATACACT TGCTGATGCG GTGCTGATTA 
AAACCTTATT TATCAGCCGG AAAACCTACC 



AATACGCCCA CGCGATGGGT AACAGTCTTG 

3120 



GTCAGTATCC CCGTTTACAG GGCGGCTTCG 

3180 

AATATGATGA AAACGGCAAC CCGTGGTCGG 

3240 

ACGATCGCCA GTTCTGTATG AACGGTCTGG 

3300 



TGACGGAAGC AAAACACCAG CAGCAGTTTT 

3360 

AAGTGACCAG CGAATACCTG TTCCGTCATA 

3420 

CGCTGGATGG TAAGCCGCTG GCAAGCGGTG 

3480 

AACAGTTGAT TGAACTGCCT GAACTACCGC 

3540 

CAGTACGCGT AGTGCAACCG AACGCGACCG 

3600 

GGCAGCAGTG GCGTCTGGCG GAAAACCTCA 

3660 

TCCCGCATCT GACCACCAGC GAAATGGATT 

3720 

AATTTAACCG CCAGTCAGGC TTTCTTTCAC 

3780 

TGACGCCGCT GCGCGATCAG TTCACCCGTG 

3840 

AAGCGACCCG CATTGACCCT AACGCCTGGG 

3900 

AGGCCGAAGC AGCGTTGTTG CAGTGCACGG 

3960 

CGACCGCTCA CGCGTGGCAG CATCAGGGGA 

4020 



GGATTGATGG TAGTGGTCAA ATGGCGATTA 

4080 
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FIGURE 2E 



CCGTTGATGT TGAAGTGGCG AGCGATACAC 
AGCTGGCGCA GGTAGCAGAG CGGGTAAACT 
CCGACCGCCT TACTGCCGCC TGTTTTGACC 
CCCCGTACGT CTTCCCGAGC GAAAACGGTC 
GCCCACACCA GTGGCGCGGC GACTTCCAGT 
TGATGGAAAC CAGCCATCGC CATCTGCTGC 
ACGGTTTCCA TATGGGGATT GGTGGCGACG 
TACAGCTGAG CGCCGGTCGC TACCATTACC 
CGGGCAGGCC ATGTCTGCCC GTATTTCGCG 
ACACAAACTT TTGGATGTTC GGTTTATTCT 
ACTTCCCGTT TTTCCCGATT TGGCTACATG 
GTATTATTTT TGCCGCTATT TCTCTGTTCT 
TTTCTGACAA ACTCGGCCTC GACTCTAGGC 
ATTGATGAGT TTGGACAAAC CACAACTAGA 
ATTTGTGATG CTATTGCTTT ATTTGTAACC 
AACAATTGCA TTCATTTTAT GTTTCAGGTT 
CCTCTAGAGT CGAGTAGATA AGTAGCATGG 



CGCATCCGGC GCGGATTGGC CTGAACTGCC 

4140 

GGCTCGGATT AGGGCCGCAA GAAAACTATC 

4200 

GCTGGGATCT GCCATTGTCA GACATGTATA 

4260 

TGCGCTGCGG GACGCGCGAA TTGAATTATG 

4320 



TCAACATCAG CCGCTACAGT CAACAGCAAC 

4380 

ACGCGGAAGA AGGCACATGG CTGAATATCG 

4440 

ACTCCTGGAG CCCGTCAGTA TCGGCGGAAT 

4500 

AGTTGGTCTG GTGTCAAAAA TAATAATAAC 

4560 



TAAGGAAATC CATTATGTAC TATTTAAAAA 

4620 

TTTTCTTTTA CTTTTTTATC ATGGGAGCCT 

4680 

ACATCAACCA TATCAGCAAA AGTGATACGG 

4740 

CGCTATTATT CCAACCGCTG TTTGGTCTGC 

4800 



GGCCGCGGGG ATCCAGACAT GATAAGATAC 

4860 

ATGCAGTGAA AAAAATGCTT TATTTGTGAA 

4920 

ATTATAAGCT GCAATAAACA AGTTAACAAC 

4980 

CAGGGGGAGG TGTGGGAGGT TTTTTCGGAT 

5040 



CGGGTTAATC ATTAACTACA AGGAACCCCT 

5100 



WO 96/13598 

AGTGATGGAG TTGGCCACTC 
AAAGGTCGCC CGACGCCCGG 
CAGATCTGGA AGGTGCTGAG 
GGCGGTAAAC ATATTAGGAA 
GATCACTTGG TGCTGGCCTG 
TGAGGTACTG AAATGTGTGG 
CTTATGTAGT TTTGTATCTG 
GATGGAAGCA TTGTGAGCTC 
CAGAATGTGA TGGGCTCCAG 
TTGACCTACG AGACCGTGTC 
GCCGCTGCAG CCACCGCCCG 
AGCAGTGCAG CTTCCCGTTC 
TTGGATTCTT TGACCCGGGA 
CAGGTTTCTG CCCTGAAGGC 
CCAGACTCTG TTTGGATTTG 
CGCGCGCGGT AGGCCCGGGA 
AGGACGTGGT AAAGGTGACT 
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FIGURE 2F 
CCTCTCTGCG CGCTCGCTCG 
GCTTTGCCCG GGCGGCCTCA 
GTACGATGAG ACCCGCACCA 
CCAGCCTGTG ATGCTGGATG 
CACCCGCGCT GAGTTTGGCT 
GCGTGGCTTA AGGGTGGGAA 
TTTTGCAGCA GCCGCCGCCG 
ATATTTGACA ACGCGCATGC 
CATTGATGGT CGCCCCGTCC 
TGGAACGCCG TTGGAGACTG 
CGGGATTGTG ACTGACTTTG 
ATCCGCCCGC GATGACAAGT 
ACTTAATGTC GTTTCTCAGC 
TTCCTCCCCT CCCAATGCGG 
GATCAAGCAA GTGTCTTGCT 
CCAGCGGTCT CGGTCGTTGA 
CTGGATGTTC AGATACATGG 
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CTCACTGAGG CCGGGCGACC 

5160 



GTGAGCGAGC GAGCGCGCAG 

5220 

GGTGCAGACC CTGCGAGTGT 

5280 



TGACCGAGGA GCTGAGGCCC 

5340 

CTAGCGATGA AGATACAGAT 

5400 

AGAATATATA AGGTGGGGGT 

5460 

CCATGAGCAC CAACTCGTTT 

5520 

CCCCATGGGC CGGGGTGCGT 

5580 

TGCCCGCAAA CTCTACTACC 

5640 



CAGCCTCCGC CGCCGCTTCA 

5700 

CTTTCCTGAG CCCGCTTGCA 

5760 

TGACGGCTCT TTTGGCACAA 

5820 

AGCTGTTGGA TCTGCGCCAG 

5880 

TTTAAAACAT AAATAAAAAA 

5940 

GTCTTTATTT AGGGGTTTTG 

6000 

GGGTCCTGTG TATTTTTTCC 

6060 

GCATAAGCCC GTCTCTGGGG 

6120 
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FIGURE 26 



TGGAGGTAGC ACCACTGCAG AGCTTCATGC 
TAGCAGGAGC GCTGGGCGTG GTGCCTAAAA 
GGCAGGCCCT TGGTGTAAGT GTTTACAAAG 
GATATGAGAT GCATCTTGGA CTGTATTTTT 
CGGGGATTCA TGTTGTGCAG AACCACCAGC 
TCATGTAGCT TAGAAGGAAA TGCGTGGAAG 
TTTTCCATGC ATTCGTCCAT AATGATGGCA 
ATATTTCTGG GATCACTAAC GTCATAGTTG 
TTTACAAAGC GCGGGCGGAG GGTGCCAGAC 
GCGTAGTTAC CCTCACAGAT TTGCATTTCC 
TCTACCTGCG GGGCGATGAA GAAAACGGTT 
AGCAGGTTCC TGAGCAGCTG CGACTTACCG 
ACCGGGTGCA ACTGGTAGTT AAGAGAGCTG 
ACTTCGTTAA GCATGTCCCT GACTCGCATG 
TCGCCGCCCA GCGATAGCAG TTCTTGCAAG 
TCCGCCGTAG GCATGCTTTT GAGCGTTTGA 
GTCACCTGCT CTACGGCATC TCGATCCAGC 



TGCGGGGTGG TGTTGTAGAT GATCCAGTCG 

6180 



ATGTCTTTCA GTAGCAAGCT GATTGCCAGG 

6240 



CGGTTAAGCT GGGATGGGTG CATACGTGGG 

6300 



AGGTTGGCTA TGTTCCCAGC CATATCCCTC 

6360 



ACAGTGTATC CGGTGCACTT GGGAAATTTG 

6420 



AACTTGGAGA CGCCCTTGTG ACCTCCAAGA 

6480 



ATGGGCCCAC GGGCGGCGGC CTGGGCGAAG 

6540 

TGTTCCAGGA TGAGATCGTC ATAGGCCATT 

6600 



TGCGGTATAA TGGTTCCATC CGGCCCAGGG 

6660 



CACGCTTTGA GTTCAGATGG GGGGATCATG 

6720 



TCCGGGGTAG GGGAGATCAG CTGGGAAGAA 

6780 

CAGCCGGTGG GCCCGTAAAT CACACCTATT 

6840 

CAGCTGCCGT CATCCCTGAG CAGGGGGGCC 

6900 



TTTTCCCTGA CCAAATCCGC CAGAAGGCGC 

6960 



GAAGCAAAGT TTTTCAACGG TTTGAGACCG 

7020 

CCAAGCAGTT CCAGGCGGTC CCACAGCTCG 

7080 

ATATCTCCTC GTTTCGCGGG TTGGGGCGGC 

7140 
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FIGURE 2H 



TTTCGCTGTA CGGCAGTAGT CGGTGCTCGT 
GGCGCAGGGT CCTCGTCAGC GTAGTCTGGG 
CGCTGGCCAG GGTGCGCTTG AGGCTGGTCC 
CCTGCGCGTC GGCCAGGTAG CATTTGACCA 
GGCCCTTGGC GCGCAGCTTG CCCTTGGAGG 
TGAGGGCGTA GAGCTTGGGC GCGAGAAATA 
AGGCCCCGCA GACGGTCTCG CATTCCACGA 
AAACCAGGTT TCCCCCATGC TTTTTGATGC 
GTCCACGCTC GGTGACGAAA AGGCTGTCCG 
CCTCGACCGA TGCCCTTGAG AGCCTTCAAC 
ATGACTATCG TCGCCGCACT TATGACTGTC 
CCGGCAGCGC TCTGGGTCAT TTTCGGCGAG 
GGCCTGTCGC TTGCGGTATT CGGAATCTTG 
CCCGCCACCA AACGTTTCGG CGAGAAGCAG 
CTGGGCTACG TCTTGCTGGC GTTCGCGACG 
CTTCTCGCTT CCGGCGGCAT CGGGATGCCC 
GATGACGACC ATCAGGGACA GCTTCAAGGA 



CCAGACGGGC CAGGGTCATG TCTTTCCACG 

7200 



TCACGGTGAA GGGGTGCGCT CCGGGCTGCG 

7260 



TGCTGGTGCT GAAGCGCTGC CGGTCTTCGC 

7320 



TGGTGTCATA GTCCAGCCCC TCCGCGGCGT 

7380 



AGGCGCCGCA CGAGGGGCAG TGCAGACTTT 

7440 



CCGATTCCGG GGAGTAGGCA TCCGCGCCGC 

7500 



GCCAGGTGAG CTCTGGCCGT TCGGGGTCAA 

7560 



GTTTCTTACC TCTGGTTTCC ATGAGCCGGT 

7620 



TGTCCCCGTA TACAGACTTG AGAGGCCTGT 

7680 



CCAGTCAGCT CCTTCCGGTG GGCGCGGGGC 

7740 



TTCTTTATCA TGCAACTCGT AGGACAGGTG 

7800 



GACCGCTTTC GCTGGAGCGC GACGATGATC 

7860 



CACGCCCTCG CTCAAGCCTT CGTCACTGGT 

7920 



GCCATTATCG CCGGCATGGC GGCCGACGCG 

7980 

CGAGGCTGGA TGGCCTTCCC CATTATGATT 

8040 



GCGTTGCAGG CCATGCTGTC CAGGCAGGTA 

8100 



TCGCTCGCGG CTCTTACCAG CCTAACTTCG 

8160 



WO 96/13598 

ATCACTGGAC CGCTGATCGT 
TTGGCAT6GA TTGTAGGCGC 
GCATGGAGCC GGGCCACCTC 
CCACTCCAAG AATTGGAGCC 
CTTGGCAGAA CATATCCATC 
GCAGCGTTGG GTCCTGGCCA 
TAGGCTGGCG GGGTTGCCTT 
GAAGCGACTG CTGCTGCAAA 
TCCGTGTTTC GTAAAGTCTG 
TCTGCATCGC AGGATGCTGC 
CTTTCTCAAT GCTCACGCTG 
GGCTGTGTGC ACGAACCCCC 
CTTGAGTCCA ACCCGGTAAG 
ATTAGCAGAG CGAGGTATGT 
GGCTACACTA GAAGGACAGT 
AAAAGAGTTG GTAGCTCTTG 
GTTTGCAAGC AGCAGATTAC 
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FIGURE 21 
CACGGCGATT TATGCCGCCT 
CGCCCTATAC CTTGTCTGCC 
GACCTGAATG GAAGCCGGCG 
AATCAATTCT TGCGGAGAAC 
GCGTCCGCCA TCTCCAGCAG 
CGGGTGCGCA TGATCGTGCT 
ACTGGTTAGC AGAATGAATC 
ACGTCTGCGA CCTGAGCAAC 
GAAACGCGGA AGTCAGCGCC 
TGGCTACCCT GTGGAACACC 
TAGGTATCTC AGTTCGGTGT 
CGTTCAGCCC GACCGCTGCG 
ACACGACTTA TCGCCACTGG 
AGGCGGTGCT ACAGAGTTCT 
ATTTGGTATC TGCGCTCTGC 
ATCCGGCAAA CAAACCACCG 
GCGCAGAAAA AAAGGATCTC 
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CGGCGAGCAC ATGGAACGGG 

8220 



TCCCCGCGTT GCGTCGCGGT 

8280 



GCACCTCGCT AACGGATTCA 

8340 



TGTGAATGCG CAAACCAACC 

8400 



CCGCACGCGG CGCATCTCGG 

8460 



CCTGTCGTTG AGGACCCGGC 

8520 

ACCGATACGC GAGCGAACGT 

8580 

AACATGAATG GTCTTCGGTT 

8640 



CTGCACCATT ATGTTCCGGA 

8700 

TACATCTGTA TTAACGAAGC 

8760 



AGGTCGTTCG CTCCAAGCTG 

8820 



CCTTATCCGG TAACTATCGT 

8880 

CAGCAGCCAC TGGTAACAGG 

8940 



TGAAGTGGTG GCCTAACTAC 

9000 

TGAAGCCAGT TACCTTCGGA 

9060 

CTGGTAGCGG TGGTTTTTTT 

9120 

AAGAAGATCC TTTGATCTTT 

9180 



WO 96/13598 

TCTAC6GG6T CTGACGCTCA 
TTATCAAAAA GGATCTTCAC 
TAAAGTATAT ATGAGTAAAC 
ATCTCAGCGA TCTGTCTATT 
ACTACGATAC GGGAGGGCTT 
CGCTCACCGG CTCCAGATTT 
AGTGGTCCTG CAACTTTATC 
GTAAGTAGTT CGCCAGTTAA 
GTGTCACGCT CGTCGTTTGG 
GTTACATGAT CCCCCATGTT 
GTCAGAAGTA AGTTGGCCGC 
CTTACTGTCA TGCCATCCGT 
TTCTGAGAAT AGTGTATGCG 
ACCGCGCCAC ATAGCAGAAC 
AAACTCTCAA GGATCTTACC 
AACTGATCTT CAGCATCTTT 
CAAAATGCCG CAAAAAAGGG 
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FIGURE 2J 
GTGGAACGAA AACTCACGTT 
CTAGATCCTT TTAAATTAAA 
TTGGTCTGAC AGTTACCAAT 
TCGTTCATCC ATAGTTGCCT 
ACCATCTGGC CCCAGTGCTG 
ATCAGCAATA AACCAGCCAG 
CGCCTCCATC CAGTCTATTA 
TAGTTTGCGC AACGTTGTTG 
TATGGCTTCA TTCAGCTCCG 
GTGCAAAAAA GCGGTTAGCT 
AGTGTTATCA CTCATGGTTA 
AAGATGCTTT TCTGTGACTG 
GCGACCGAGT TGCTCTTGCC 
TTTAAAAGTG CTCATCATTG 
GCTGTTGAGA TCCAGTTCGA 
TACTTTCACC AGCGTTTCTG 
AATAAGGGCG ACACGGAAAT 
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AAGGGATTTT GGTCATGAGA 

9240 

AATGAAGTTT TAAATCAATC 

9300 

GCTTAATCAG TGAGGCACCT 

9360 

GACTCCCCGT CGTGTAGATA 

9420 

CAATGATACC GCGAGACCCA 

9480 

CCGGAAGGGC CGAGCGCAGA 

9540 



ATTGTTGCCG GGAAGCTAGA 

9600 

CCATTGCTGC AGGCATCGTG 

9660 

GTTCCCAACG ATCAAGGCGA 

9720 

CCTTCGGTCC TCCGATCGTT 

9780 

TGGCAGCACT GCATAATTCT 

9840 

GTGAGTACTC AACCAAGTCA 

9900 

CGGCGTCAAC ACGGGATAAT 

9960 

GAAAACGTTC TTCGGGGCGA 

10020 

TGTAACCCAC TCGTGCACCC 

10080 

GGTGAGCAAA AACAGGAAGG 

10140 

GTTGAATACT CATACTCTTC 

10200 
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FIGURE 2K 

CTTTTTCAAT ATTATTGAAG CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT 

X02oO 

GAATGTATTT AGAAAAATAA ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA 
CCTGACGTCT AAGAAACCAT TATTATCATG ACATTAACCT ATAAAAATAG GCGTATCACG 

AGGCCCTTTC GTCTTCAA 
10398 



WO 96/13598 
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FIG. 3 
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FIGURE 5A 



TCGCGCGTTT CGGTGATGAC GGTGAAAACC 
CAGCTTGTCT GTAAGCGGAT GCCGGGAGCA 
TTGGCGGGTG TCGGGGCTGG CTTAACTATG 
ACCATATGCG GTGTGAAATA CCGCACAGAT 
ATTCGCCATT CAGGCTGCGC AACTGTTGGG 
TACGCCAGCT GGCGAAAGGG GGATGTGCTG 
TTTCCCAGTC ACGACGTTGT AAAACGACGG 
ACTCTAGAGG ATCCGAAAAA ACCTCCCACA 
TGCAATTGTT GTTGTTAACT TGTTTATTGC 
CATCACAAAT TTCACAAATA AAGCATTTTT 
ACTCATCAAT GTATCTTATC ATGTCTGGAT 
AAGATGCAGT CATCCAAATC CACATTGACC 
TTTCCCATGA TATGATGAAT GTAGCACAGT 
GGTTGAGATT CTGACACGGG AAAGCACTCT 
ATATTTGAAT TCTGATTCAT TCTCTCGCAT 
CCCACGTGAC GAGAACATTT GTTTTGGTAC 
TCTGACGTCG ATGGCTGCGC AACTGACTCG 



TCTGACACAT GCAGCTCCCG GAGACGGTCA 

60 

GACAAGCCCG TCAGGGCGCG TCAGCGGGTG 

120 

CGGCATCAGA GCAGATTGTA CTGAGAGTGC 

180 



GCGTAAGGAG AAAATACCGC ATCAGGCGCC 

240 

AAGGGCGATC GGTGCGGGCC TCTTCGCTAT 

300 



CAAGGCGATT AAGTTGGGTA ACGCCAGGGT 

360 

CCAGTGCCAA GCTTGCATGC CTGCAGGTCG 

420 



CCTCCCCCTG AACCTGAAAC ATAAAATGAA 

480 

AGCTTATAAT GGTTACAAAT AAAGCAATAG 

540 

TTCACTGCAT TCTAGTTGTG GTTTGTCCAA 

600 

CCCCGCGGCC GCCAAATCAT TTATTGTTCA 

660 

AGATCGCAGG CAGTGCAAGC GTCTGGCACC 

720 

TTCTGATACG CCTTTTTGAC GACAGAAACG 

780 



AAACAGTCTT TCTGTCCGTG AGTGAAGCAG 

840 

TGTCTGCAGG GAAACAGCAT CAGATTCATG 

900 

CTGTCTGCGT AGTTGATCGA AGCTTCCGCG 

960 

CGCACCCGTT TGGGCTCACT TATATCTGCG 

1020 
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FIGURE 5B 



TCACTGGGGG CGGGTCTTTT CTTGGCTCCA CCCTTTTTGA CGTAGAATTC ATGCTCCACC 

1080 

TCAACCACGT GATCCTTTGC CCACCGGAAA AAGTCTTTGA CTTCCTGCTT GGTGACCTTC 

1140 



CCAAAGTCAT GATCCAGACG GCGGGTGAGT TCAAATTTGA ACATCCGGTC TTGCAACGGC 

1200 

TGCTGGTGTT CGAAGGTCGT TGAGTTCCCG TCAATCACGG CGCACATGTT GGTGTTGGAG 

1260 



GTGACGATCA CGGGAGTCGG GTCTATCTGG GCCGAGGACT TGCATTTCTG GTCCACGCGC 

1320 



ACCTTGCTTC CTCCGAGAAT GGCTTTGGCC GACTCCACGA CCTTGGCGGT CATCTTCCCC 

1380 

TCCTCCCACC AGATCACCAT CTTGTCGACA CAGTCGTTGA AGGGAAAGTT CTCATTGGTC 

1440 



CAGTTTACGC ACCCGTAGAA GGGCACAGTG TGGGCTATGG CCTCCGCGAT GTTGGTCTTC 

1500 

CCGGTAGTTG CAGGCCCAAA CAGCCAGATG GTGTTCCTCT TGCCGAACTT TTTCGTGGCC 

1560 

CATCCCAGAA AGACGGAAGC CGCATATTGG GGATCGTACC CGTTTAGTTC CAAAATTTTA 

1620 

TAAATCCGAT TGCTGGAAAT GTCCTCCACG GGCTGCTGGC CCACCAGGTA GTCGGGGGCG 

1680 

GTTTTAGTCA GGCTCATAAT CTTTCCCGCA TTGTCCAAGG CAGCCTTGAT TTGGGACCGC 

1740 

GAGTTGGAGG CCGCATTGAA GGAGATGTAT GAGGCCTGGT CCTCCTGGAT CCACTGCTTC 

1800 



TCCGAGGTAA TCCCCTTGTC CACGAGCCAC CCGACCAGCT CCATGTACCT GGCTGAAGTT 

1860 



TTTGATCTGA TCACCGGCGC ATCAGAATTG GGATTCTGAT TCTCTTTGTT CTGCTCCTGC 

1920 

GTCTGCGACA CGTGCGTCAG ATGCTGCGCC ACCAACCGTT TACGCTCCGT GAGATTCAAA 

1980 

CAGGCGCTTA AATACTGTTC CATATTAGTC CACGCCCACT GGAGCTCAGG CTGGGTTTTG 

2040 
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FIGURE 5C 



GGGAGCAAGT AATTGGGGAT GTAGCACTCA TCCACCACCT TGTTCCCGCC TCCGGCGCCA 

2100 

TTTCTGGTCT TTGTGACCGC GAACCAGTTT GGCAAAGTCG GCTCGATCCC GCGGTAAATT 

2160 

CTCTGAATCA GTTTTTCGCG AATCTGACTC AGGAAACGTC CCAAAACCAT GGATTTCACC 

2220 

CCGGTGGTTT CCACGAGCAC GTGCATGTGG AAGTAGCTCT CTCCCTTCTC AAATTGCACA 

2280 

AAGAAAAGGG CCTCCGGGGC CTTACTCACA CGGCGCCATT CCGTCAGAAA GTCGCGCTGC 

2340 

AGCTTCTCGG CCACGGTCAG GGGTGCCTGC TCAATCAGAT TCAGATCCAT GTCAGAATCT 

2400 

GGCGGCAACT CCCATTCCTT CTCGGCCACC CAGTTCACAA AGCTGTCAGA AATGCCGGGC 

2460 

AGATGCCCGT CAAGGTCGCT GGGGACCTTA ATCACAATCT CGTAAAACCC CGGCATGGCG 

2520 

GCTGCGCGTT CAAACCTCCC GCTTCAAAAT GGAGACCCTG CGTGCTCACT CGGGCTTAAA 

2580 

TACCCAGCGT GACCACATGG TGTCGCAAAA TGTCGCAAAA CACTCACGTG ACCTCTAATA 

2640 

CAGGACTCTA GAGGATCCCC GGGTACCGAG CTCGAATTCG TAATCATGGT CATAGCTGTT 

2700 

TCCTGTGTGA AATTGTTATC CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA 

2760 

GTGTAAAGCC TGGGGTGCCT AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT 

2820 

GCCCGCTTTC CAGTCGGGAA ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC 

2880 

GGGGAGAGGC GGTTTGCGTA TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG 

2940 

CTCGGTCGTT CGGCTGCGGC GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC 

3000 

CACAGAATCA GGGGATAACG CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG 

3060 
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6AACC6TAAA AAGGCCGCGT 
TCACAAAAAT CGACGCTCAA 
GGCGTTTCCC CCTGGAAGCT 
ATACCTGTCC GCCTTTCTCC 
GTATCTCAGT TCGGTGTAGG 
TCAGCCCGAC CGCTGCGCCT 
CGACTTATCG CCACTGGCAG 
CGGTGCTACA GAGTTCTTGA 
TGGTATCTGC GCTCTGCTGA 
CGGCAAACAA ACCACCGCTG 
CAGAAAAAAA GGATCTCAAG 
GAACGAAAAC TCACGTTAAG 
GATCCTTTTA AATTAAAAAT 
GTCTGACAGT TACCAATGCT 
TTCATCCATA GTTGCCTGAC 
ATCTGGCCCC AGTGCTGCAA 
AGCAATAAAC CAGCCAGCCG 
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FIGURE 5D 

TGCTGGCGTT TTTCCATAGG 

GTCAGAGGTG GCGAAACCCG 

CCCTCGTGCG CTCTCCTGTT 

CTTCGGGAAG CGTGGCGCTT 

TCGTTCGCTC CAAGCTGGGC 

TATCCGGTAA CTATCGTCTT 

CAGCCACTGG TAACAGGATT 

AGTGGTGGCC TAACTACGGC 

AGCCAGTTAC CTTCGGAAAA 

GTAGCGGTGG TTTTTTTGTT 

AAGATCCTTT GATCTTTTCT 

GGATTTTGGT CATGAGATTA 

GAAGTTTTAA ATCAATCTAA 

TAATCAGTGA GGCACCTATC 

TCCCCGTCGT GTAGATAACT 

TGATACCGCG AGACCCACGC 

GAAGGGCCGA GCGCAGAAGT 
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CTCCGCCCCC CTGACGAGCA 

3120 

ACAGGACTAT AAAGATACCA 

3180 



CCGACCCTGC CGCTTACCGG 

3240 

TCTCATAGCT CACGCTGTAG 

3300 

TGTGTGCACG AACCCCCCGT 

3360 

GAGTCCAACC CGGTAAGACA 

3420 

AGCAGAGCGA GGTATGTAGG 

3480 

TACACTAGAA GGACAGTATT 

3540 



AGAGTTGGTA GCTCTTGATC 

3600 

TGCAAGCAGC AGATTACGCG 

3660 

ACGGGGTCTG ACGCTCAGTG 

3720 

TCAAAAAGGA TCTTCACCTA 

3780 

AGTATATATG AGTAAACTTG 

3840 

TCAGCGATCT GTCTATTTCG 

3900 

ACGATACGGG AGGGCTTACC 

3960 

TCACCGGCTC CAGATTTATC 

4020 

GGTCCTGCAA CTTTATCCGC 

4080 
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CTCCATCCAG TCTATTAATT 
TTTGCGCAAC 6TTGTTGCCA 
GGCTTCATTC AGCTCCGGTT 
CAAAAAAGCG GTTAGCTCCT 
GTTATCACTC ATGGTTATGG 
ATGCTTTTCT GTGACTGGTG 
ACCGAGTTGC TCTTGCCCGG 
AAAAGTGCTC ATCATTGGAA 
GTTGAGATCC AGTTCGATGT 
TTTCACCAGC GTTTCTGGGT 
AAGGGCGACA CGGAAATGTT 
TTATCAGGGT TATTGTCTCA 
AATAGGGGTT CCGCGCACAT 
TATCATGACA TTAACCTATA 
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FIGURE 5E 
GTTGCCGGGA AGCTAGAGTA 
TTGCTACAGG CATCGTGGTG 
CCCAACGATC AAGGCGAGTT 
TCGGTCCTCC GATCGTTGTC 
CAGCACTGCA TAATTCTCTT 
AGTACTCAAC CAAGTCATTC 
CGTCAATACG GGATAATACC 
AACGTTCTTC GGGGCGAAAA 
AACCCACTCG TGCACCCAAC 
GAGCAAAAAC AGGAAGGCAA 
GAATACTCAT ACTCTTCCTT 
TGAGCGGATA CATATTTGAA 
TTCCCCGAAA AGTGCCACCT 
AAAATAGGCG TATCACGAGG 
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AGTAGTTCGC CAGTTAATAG 

4140 

TCACGCTCGT CGTTTGGTAT 

4200 

ACATGATCCC CCATGTTGTG 

4260 



AGAAGTAAGT TGGCCGCAGT 

4320 



ACTGTCATGC CATCCGTAAG 

4380 

TGAGAATAGT GTATGCGGCG 

4440 

GCGCCACATA GCAGAACTTT 

4500 

CTCTCAAGGA TCTTACCGCT 

4560 

TGATCTTCAG CATCTTTTAC 

4620 

AATGCCGCAA AAAAGGGAAT 

4680 



TTTCAATATT ATTGAAGCAT 

4740 



TGTATTTAGA AAAATAAACA 

4800 

GACGTCTAAG AAACCATTAT 

4860 



CCCTTTCGTC 
4910 
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